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Preface 


This textbook has evolved from a set of lecture notes which I pre- 
pared for a semester course in Hilbert space at the State University 
of Iowa. In both the course and the book, I have in mind first- or 
second-year graduate students in Mathematics and related fields such 
as Physics. 

It is necessary for the reader to have a foundation in advanced 
calculus which includes familiarity with: least upper bound (LUB) 
and greatest lower bound (GLB), the concept of function, «’s and 
their companion 4’s, and basie properties of sequences of real and 
complex numbers (convergence, Cauchy’s criterion, the Weierstrass- 
Bolzano theorem). It is not presupposed that the reader is acquainted 
with vector spaces (which are discussed in Chapter I), matrices 
(which are mentioned only in a few exercises), or determinants (which 
are mentioned nowhere else in the book). In keeping with these pre- 
requisites, 1 have refrained from using set-theoretic notation in the 
first two chapters; some of the notations and terminology of set theory 
are gradually introduced thereafter. Occasionally I have inserted re- 
marks and exercises, not essential to the exposition, which call for con- 
siderably more background than the general level of the text. Such 
statements can be recognized by the phrase “‘it can be shown”; they 
are likely to be out of reach of the reader’s technique, but not out of 
reach of his curiosity. 

There are over four hundred exercises, most of them easy. Some of 
these describe generalizations of material in the text. Others provide 
examples and counter-examples for the definitions and theorems of 
the text. Any of them may be used for practice in the construction 
of proofs. Hints for solution, and references to other sources, will be 
found in the Appendix. Exercises in the “it can be shown” category 
are marked with a *. The reader who prefers to vault over the exer- 
cises may do so without breaking the continuity of the exposition, 
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The “‘spectral theorem” presented here is confined to completely 
continuous normal operators, the most elementary infinite case, This 
is not enough for many applications, but it is about as far as one can 
get without sophisticated techniques. Many beautiful expositions of 
advanced spectral theory are available to the interested and prepared 
reader (a few of these are listed in the Appendix). 

It is my hope that this book, aside from being an exposition of 
certain basic material on Hilbert space, may also serve as an intro- 
duction to other areas of functional analysis. It seems to me that 
Hilbert space, fortified with the experience and geometrical intuition 
developed in two- and three-dimensional cases, is an excellent vehicle 
for mtroducing topics such as vector spaces, metrie spaces, Banach 
spaces, Banach algebras, and so on. Such opportunities are exploited 
frequently in the text and in the exercises. 

The system of bookkeeping I have used is as follows. Each chapter 
is organized into sections (§). Within each §, Definitions, Examples, 
Theorems, and Exercises are numbered starting at 1. For example: 
Theorem VI.8.1 is the first theorem in §8 of Chapter VI; within a 
chapter, Theorem 3.2 refers to the second theorem in § 3 of that chap- 
ter; within a ὃ, Theorem 8 refers to the third theorem in that ὃ. The 
end of a proof is indicated by the Halmos finality symbol ἢ; I have 
also used this symbol in a few other places to indicate a pause before 
a new train of ideas begins. J 

I am grateful to my colleagues Lloyd A. Knowler and William T. 
Reid, and to the staff of the Oxford University Press, for encouraging 
me to bring forth this book. 

S. K. B. 
September 1961 
Iowa City 
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§ 1. Complex vector spaces 
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§ 4, Linear combinations of vectors 
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§ 6. Linear independence 
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81. COMPLEX VECTOR SPACES 


Underlying every Hilbert space, there is a vector space; the present 
chapter contains preparatory material on vector spaces. The reader 
who is already acquainted with the basic theory of vector spaces can 
pass directly to Chapter II, for there is nothing in the present chap- 
ter which is particularly oriented toward Hilbert space. 

In the sequel, complex numbers will also be referred to as scalars, 


Definition 1. A vector space Ὃ is a set of objects x,y,z,--- called vec- 
tors. One vector is distinguished, called the zero vector, and denoted 0. 
For each vector x, there is distinguished a vector —z, called the negative 
of x. The following axioms are assumed to hold. 
(A) For each pair of vectors x,y, there ts determined a vector called 
the sum of x and y, denoted x + y (read “x plus y’”). The “addt- 
tion” of vectors is subject to these rules: 


(Al) e+ty=yt2 (commulative law) 
(A2) 2+ (y+2) =(@+y) +2 (associative law) 
(A3) «<+0=2 


(A4) 2+ (—2) Ξ θ. 
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(M) For each scalar \, and each vector x, there is determined a vector 
called the multiple of x by ἃ (or the product of ἃ and x), denoted 
Az. The “multiplication” of vectors by scalars is subject to these 
rules: 
(M1) λί  ψὴ =Ar+dry 
(M2) A+ μὴ) = dr + μα 
(M3) (λμ)Σ = A(x) (associative law) 
(M4) lz =z. 

The reader will observe that in (M2), the symbol + is used in two 
senses: for the sum of scalars, and the sum of vectors. In (M3), the 
juxtaposition of letters has two meanings: the product of two scalars, 
or the product of a scalar and a vector. 


(distributive laws) 


Examples 


1. Given a fixed positive integer n. Let U be the set of all symbols 
x = (Ay,***,An), called n-ples, where );,--+,An are scalars. If 2 = 
(A1,°**;An) and y = (y,--*,#n), write «= y in case dy = μα for 
k = 1,---,n;, is called the k’th component of x. Define @ = (0,---,0), 
—x = (--τλι," 5, τλὴ}, ὦ Ἔν = Ar +a, ++,An + un), and Ar = 
(Ady,"**,AAn). The axioms (A) and (Μὴ are verified by using the 
given criterion for equality, and the analogous properties of sums and 
products of scalars. For instance, if ἃ = (Ay,:-*,An) and y = 
(41,°*:,H#n), the relations Ay + we = μα - λα (kK = 1,-+-,n) yield 
x+y = y +2. In the sequel, this example will be referred to as the 
vector space of n-ples. 


2. Let Ὃ be the set of all functions x defined on a set 3, having 
scalar values. Write x = y in case x(t) = y(t) for all ὁ in 3. Define the 
functions 0, —2, x + y, and Ax by the formulas 


a(t) = 0 
(—z2)(t) = —2(¢) 
@+yO Ξ χ(ἢ + yO 
(Ax) (t) = Ax(d). 


This example will be referred to as the vector space of scalar-valued 
functions on 5. 
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One can obtain Hxample 1 as a special case: take for 5 the set of 
integers 1,---,n, and interpret = (A;,---,An) as the function on 3 
such that x(k) = dx. If 5 is the set of all positive integers 1,2,3,---, 
then U can be interpreted as the set of all sequences x = (Ax) with 
scalar terms dz (k = 1,2,3,---); in this context, two sequences are 
considered to be equal if they are equal term by term. 


3. Let Ὃ be the set of all scalar-valued functions x defined, and con- 
tinuous, on the closed interval [a,b]. Write x = y in case z(t) = y(@) 
for all ἡ, a Ξ ὁ < b. Define 0, —2z, x + y, and Ax as in Example 2; 
these functions are continuous by elementary calculus. This example 
will be referred to as the vector space of continuous functions on {a,}]. 


4. Let U be the set of all polynomial functions, with complex co- 
efficients, defined on the closed interval [a,b]. Equality, zero, nega- 
tives, sums, and scalar multiples are defined as in Example 3. The 
sum of two polynomial functions, and a scalar multiple of a poly- 
nomial function, are themselves polynomial functions. This example 
will be referred to as the vector space of polynomial functions on {a,b}. 


5. Given a fixed positive integer n. Let U be the set of all poly- 
nomial functions, with complex coefficients, of degree <n, defined 
on the interval [a,b]. Sums and scalar multiples are defined as in 
Example 4. 


6. Let U be the set of all sequences = (Ax) of scalars, all of whose 
terms, from some index onward, are 0 (the particular index may vary 
from sequence to sequence). Write (Ax) = (ux) in case Ay = μα for 
all k = 1,2,3,---. Define @ = (0), —(Ax) = (—Ax), (x) + (ux) = 
(Xx + μι), and A(Ax) = (AAx). See the discussion in Example 2. This 
example will be referred to as the vector space of finitely non-zero se- 
quences. 


7. Let Ὁ be the set of all scalar-valued functions x defined on a set 
3, such that τί = Ὁ except for at most a finite number of points 7. 
Define 0, —z, x + y, and Az as in Example 2. One obtains Hxample 6 
by taking, in the place of 3, the set of all positive integers. If 3 con- 
tains only finitely many points, say n of them, one obtains Example 1 
(see the discussion in Example 2). This example will be referred to 
as the vector space of finitely non-zero functions on 3. 


8. The set Ὁ of all complex numbers u is a vector space, with μ -+ v 
and λμ defined as usual. This is essentially Example 1 with n = 1. 


"" 
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Exercises 
1. Fill in the details in the above examples. 


2. Given a vector space W, let Ὃ be the set of all functions x de- 
fined on a set 5, taking values in Ὁ. With suitable definitions, V is a 
vector space. 


3. Same as Hzercise 2, except that x has values ~@ for at most 
finitely many points ἐ. 


§2. FIRST PROPERTIES OF VECTOR SPACES 


Several immediate consequences of the axioms for a vector space 
are developed in this section. 
Theorem 1. /n any vector space: 


(1) The vector equation x + y = z has one and only one solution x, 
the vectors y and z being given in advance. 


(2) Ifz+2 =z, necessarily z = 6. 

(3) AO = 6, for every scalar i. 

(4) Ox = θ, for every vector x. 

(5) If d\x = 6, then either Ἃ = Ὁ or z = θ. 
Proof. 


(1): Given vectors y and z. Set x = z+ (—y); one hasx+y = 
[2+ (-y) ty =2+[(-y) +4] =2z+6 =z. This proves that a 
solution exists. Suppose 2 and 2 are solutions: 2; + y = z = %2 + ἡ. 
Then, (αι + y) + (—y) = @e+y+(-y), a+ly+(-y] = 
ta + [y + (—y)], σι +6 = x2 + 0, τι = 2. Thus, only one solution 
exists. 

(2): Suppose z - z = z. Since also 6 + 2 = z, z = @ follows from 
the uniqueness of solutions. 


(3): λθ = λίθ + 6) = λθ + AO, hence λθ = @ by part (2). 
(4): 02 = (0 + 0) = Ox + Oz, hence Ox = 8. 


(5): Suppose Ax = θ, and A γέ 0. If μ is the reciprocal of A, @ = 
μθ = pz) = (μλ)Σ = la =z. ἢ 
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It follows that in axiom (A3), no other vector can play the role of 
8; that is, the distinguished vector @ is unique. Furthermore, given 
any vector z, the equation z + y = @ has exactly one solution y, 
namely y = —z; thus, the only vector that can fulfill axiom (A4) is 
—z. Briefly: the zero vector is unique, and each vector has a unique 
negative. 
Definition 1. Given two vectors x and y, the vector x + (—y) is called 
the difference of x and y, and 1s denoted x — y (read “x minus y’’). 


Thus, in part (1) of Theorem 1, the unique solution z of αὶ + y = z 
ist=z—y. 
Corollary. In any vector space, 

(6) (—A)z = A(—2z) = — (Az) 

(7) λᾷ --ν = de --ὰν 

(8) A— μὴ) = dr — pe. 
Proof. 


(6): One has 6 = Ox = [A + (—A)]a = Aw + (—A)z, hence (—A)z 
= —(\x) by the uniqueness of negatives. Similarly, 6 = \@ = 
A[z + (—2z)] = Ax + λί-- 1), hence A(—z) = —(Az). The reader can 
easily supply the proofs of (7) and (8). ἢ 


In particular, (—1)z = —z. 


Exercises 


1. In any vector space, —(—z)=2z, —0=0, —(t#+y)= 
—z—-y, «—-(y+2)=@-—y)—-z, -—@-y) ξν-- 5. 


§3. FINITE SUMS OF VECTORS 


Throughout this section, all vectors are taken from an arbitrary 
vector space Ὁ. 

If 21,22,%3 are vectors, one has 2 + (χα + %3) = (τι + 22) + 23 
by the associative law; one writes simply x; + x2 + 23. Strictly ac- 
cording to the axioms, vector sums are performed with two vectors 
at a time; the symbol 2, + 22 + zg is nevertheless unambiguous, 
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since the result is the same whichever + sign is given priority. An- 
other commonly used symbol for this vector is pave , te In general: 


Definition 1. If x1,-+-,t, are vectors (n > 1), the symbol ΣΝ 1 7 
denotes the vector defined inductively as follows: 


» LE = X% + Le 
pe = (en 4} Ἔ Ζ,, (n> 2). 
Other notations: x, +--+++- 2p, Pe Lk, SA Lk. 


More generally, if m < n, and vectors %p,-++,2, are given, Σ. Lp 
denotes the vector defined inductively by the formulas 


m-+1 
Za Le = Im + Im41 


Σ᾿. t= Gog ts) +2n, (n>m+ 1). 

It is also convenient to define Zz. Te = Xn, and 5 ed 2 = θ when 
m > n. For example, Ps ἄχ = 2g, and oe τὰ = 0. 

Given vectors 2; ,%2,%3,24, consider the vector y = 2 + 2 + Xe + 24. 
Officially this is [(x, + 22) + 23] + 24, the + signs being given 
priority from left to right. It follows from the associative law that all 
other ways of assigning priority produce the same vector y: 

(ay + 22) + (v3 + 24) = [(a, + 12) +23) +a = y 
σι + (@2 + %3)] + 24 = [(σἱ + 22) +23] Ἔα = y 
τὰ + [(%2 + 23) + 24] = [δι + (x2 + 23)] + oy = y 
δι + [to + (3 + 24)] = 2 + [(xe + 23) + 24] = ψ. 


This principle holds for a sum of n vectors: 
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Theorem 1. (Generalized associative law) Given vectors 11," " -,t,. Sup- 
pose Ὁ = M “πὶ <q {--- «η,-- <M, = π᾿ Define 


y= Le Uk 


--ἰ πὸ 
ny 
yj = as Tk 


Ti 
yr = aie Uke 


Then, = 1 Ξ ae Tk 


Proof. 

The proof is by complete induction on n. If n = 2, either (i) r = 2, 
in which case πὶ = 1 and ng = 2, or (ii) r - 1, in which case m4 = 2. 
In case (i), y1 = δι and ye = 22, thus >’) yj = Do, ae. In case (ii), 

—1 .2 
Y, = δὶ + 22, thus Ds “y= Σὰ Uk. 

Assume inductively that the conclusion holds whenever the number 
of given vectors ἂχ is <n. Suppose 2,---,%, given, n > 3. Let us 
consider three cases. 


Casel.r =1 : x : 

Necessarily πὶ = n, thus )), yj = 0, ¥s =¥1 = Do, % = Σ᾿, te. 

Assume, for the remainder of the proof, that r > 1. Then }), y; = 
ἘΠΕ Yj + Yr = p Bias xy + yy, by the inductive assumption. 

Case 2. ny_y =n -- ὀ 

r Tir nA— 

Then, y, = Σ᾿, ἄχ = tn, thus). yj= Do, tty = Σ te 
-- tn = Ἔ: Lhe 

Case 3. ny_y <n—1 

In this case, the sum y, = Die: 41 Uk has at least two terms, 
hence y, = r,s te + ἀρ. Then, > yj = Do, te + Yr = 


dy + (ar, te τ te) ¥ ὑπ᾿ ar ee 3} ἘΜ 
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the latter is equal, by the inductive assumption, to ia 3} + In 
ee 


The sum of n vectors is also independent of the order in which the 
terms are written. For example, t2 + ΧΩ + 2; = (tq + 23) + 2 = 
δὶ + (χὰ + 14) = (σι + Le) + 243 = αὶ + χὰ +23. In general: 


Theorem 2. (Generalized commutative law) Given vectors 21,-++-,2p. 
Let 1',---,n' be any rearrangement of the indices 1,-++,n, and set y;, = 
p(k = i *n). Then, 2A YE = Di Lp. 
Proof. 

For n = 1, the assertion is trivial. Assume the conclusion holds 
when the number of given vectors is <n. 

Case 1. Τῇ π' = n, then yn = 2p. Also, 1’,---,(m — 1)’ isa rearrange- 


ment of 1,---,n — 1. Then, a Yr = =" Yk + Yn = aa χὰ + 
x, by the inductive assumption. 

Case 2. If n’ <n, then n = m’ for some index m < n. Consider 
the rearrangement 1*,---,n* of 1,---,n defined as follows: m* = 
π΄, n* = m' = ἡ, and k* = k’ for all remaining k; in other words, 
in the arrangement 1’,---,n', interchange m’ and π΄. Set zp = 2p 
(k = 1,-+-,n); since n* = n, one has Do 2k = pba” by case 1. 
Using Theorem 1 and axiom (A1) at the appropriate steps, ἊΣ ἄχ = 


Dis = (ΣΙ ὁ + en) + (DNs a τω) 
(Σ΄ τ εξ +e) 

= (Stan + aw) + (DG et 2») 
( 


Sr et im) + ii γε + im) = Div ἢ 
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Exercises 


1. Show directly that a, + rg, + % = τὶ + 22 + x3 for every 
rearrangement 1’,2’,3’ of 1,2,3. [There are six arrangements, also 
called permutations.] 


2. If x, = @ for all k, then 2 & = 0. 
a: = (Σ: 5) = >) (-™). 
4, = ZE + pa Ye = poe (xe + yk). 


5. If 2, = x for all k, then ὩΣ, tt = nt. 


6. Given vectors 2;.(j7 = 1,--+,m; k = 1,-++,n). For each j, define 
yj = Do, tx, and for each k, define z, = >) εἶμ. Then = yy 5, Sh 
That is, 


Dd; (=, zn) =), (x rn), 


§4. LINEAR COMBINATIONS OF VECTORS 


In the vector space of n-ples (Hxample 1.1), consider a vector 
Ὁ = (Ay,°+*,An). Let εκ denote the vector whose k’th component is 1, all 
others 0. Then, x = (A;,0,---,0) + (0,A2,0,---,0) +---+ (0,-+-,0,A,) 
= Aye) + Agee +--++ An€n. This is an instance of the following: 


Definition 1. In any vector space, a vector x is said to be a linear com- 
bination of the vectors x,,--+-+,X,, with scalar coefficients \,,---,A,, in 


case x = pe Ag. 


For example, in the vector space of 2-ples, let z = (2,3), z,; = 
(1,0), ze = (0,1), vg = (1,1). Then, x = 2a, + 8125 + Oxg = Ox, + 
1χ. + 223 = (—I1)a, + Ore + 323 = lz, + 2% + 115, ete. . κι. 

A number of useful facts about linear combinations are collected in 
the following theorem. These are easily deduced from the results of 
the preceding section; the proofs are left to the reader: 
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Theorem 1. Jn any vector space: 
4) SO Nate + ΣΟῚ mete = DO) Oe + we) 


(2) λ (x 3) => Ma 

(3) (25 ss) a= De 

@) (tx) (Zia) = ΣΣ, (S7_, x2) 
= Di. (ER). 


(5) 6%s a linear combination of x1,-*+,2n. 
(6) « is a linear combination of x. 


(7) Given vectors 2,2,++*,2n. If m <n, and x ts a linear combina- 
tion of 21,°**,%m, then x 18 also a linear combination of 21,+ + * Xn. 

(8) Suppose x is a linear combination of 2,--*,tn; if 1',-++,n' ts 
any rearrangement of 1,---,n, and yy = τὰν, then x is a linear 
combination of Y1,°* + ;Yn- 

(9) If xis a linear combination of y1,- ++ im, and each y; is a linear 
combination of 21,°**,tn, then x is a linear combination of 
Δ." * yen. 


Exercises 

1. In the space of 4-ples, express = (A1,A2,Ag,A4) a8 a linear com- 
bination of «1 = (1,0,0,0), ze = (1,1,0,0), wg = (1,1,1,0), 24 = 
(1,1,1,1). 

2. In the space of 3-ples, can x = (1,0,1) be expressed as a linear 
combination of αι = (1,1,0), ze = (0,1,1), zg = (1,2,1)? 

3. If x and y are linear combinations of 2,---,2%,, so are x + y and 
Ax. 


4. In the space of m-ples, given vectors 2,2,,-+-,%,. The problem 
of expressing x as a linear combination of 2;,---,z, is equivalent to 
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the problem of solving a certain system of m linear equations in ἡ 
unknowns. This system is homogeneous if and only if ἃ = @. 


§5. LINEAR SUBSPACES, LINEAR DEPENDENCE 


Consider, in the vector space of n-ples, the totality of all vectors x 
whose first component is 0, that is, x = (0,A2,---,An). If x and y are 
two such vectors, so are x + y and Az. Evidently, this set of vectors is 
a vector space in its own right. This is a special case of the follow- 
ing: 


Definition 1. Let U be a vector space. A linear subspace of Ὃ is a set 
9 of vectors belonging to Ὃ, such that: (i) St contains the vector 8, (ii) if x 
and y are vectors in Il, the vector x + y 18 in N, and (111) of x 2s a vector 
in X, and d is any scalar, the vector \x 18 in N. 


Briefly, a linear subspace contains sums and scalar multiples of its 
vectors. If οἵ is a linear subspace of VU, and z is a vector in 2, the rela- 
tion —x = (—1)z shows that —z is in 9; since the vectors in 91 
satisfy all the identities in the axioms for Ὃ, clearly 9 is itself a 
vector space. 

The purpose of condition (i) is to ensure that 2 is not the empty 
set of vectors, that is, contains at least one vector z; 9 will then 
contain @ = ΟΣ by condition (iii). 


Examples 


1. In any vector space, the set 21 containing just the vector 9 is a 
linear subspace; this is shown by the relations 6 + 6 = 6 and λθ = θ. 


2. Ὃ is itself a linear subspace of the vector space Ὁ. 


3. If Ὃ is a vector space, and z is a fixed vector in VU, the set 91 of all 
scalar multiples uz of z is a linear subspace. This results from the rela- 
tions 0 = Oz, wz + vz = (u + v)z, and A(uz) = (λμ)ξ. 


4. Let Ὃ be the vector space of scalar-valued functions defined on a 
set 3 (Example 1.2). If to is a fixed point of 3, the set 9t of all functions 
x such that x(fo) = 0 is a linear subspace of Ὁ. 


5. Let Ὃ be the vector space of all scalar-valued functions z defined 
on the interval [a,b]. The following is a series of linear subspaces 
(getting smaller): 
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(i) στ. the set of functions x which are continuous at a fixed point 
ty of [a,b]. 


(ii) Ne the set of functions x which are continuous at every point 
of [a,b] (this is Lxample 1.3). 

(iii) Ng the set of all polynomial functions z with complex co- 
efficients (this is Example 1.4). 

(iv) 94 the set of all polynomial functions, with complex co- 
efficients, of degree <n, where n is a fixed positive integer (this is 
Example 1.5). 


6. Let Ὁ be the vector space of continuous functions z on [a,}] 
(Example 1.3). The set 9 of all x which are differentiable on [a,b] is a 
linear subspace. 


7. Let Ὃ be the set of all twice-differentiable functions x defined on 
[a,b], and 9 the set of those functions x which satisfy the differential 
equation x” +z = 0. Then, Ὃ is a vector space, and % is a linear 
subspace of Ὁ. 


Theorem 1. Jf 31 and XN are linear subspaces of U, the set of all vectors 
of the form y + z, with y in M and z in N, ts a linear subspace of V. 


Proof. 
θ- θ-ἘΘ, (yr +21) + Yo +22) = (Yi + Yo) + (δι + 22), and 
A(y - 2) = Ay + dz. 


Definition 2. The linear subspace described in Theorem 1 is called the 
sum of I and N, and is denoted NM + N. 


Examples of such sums will be found in the exercises. 


Lemma. Jf 91 is a linear subspace of V, and x1,-++,%» are vectors in M, 
then every linear combination of 21,--+,t, belongs to MM. 


Proof. 
Σ hers 18 the same vector, whether formed in the vector space Ὃ 
or in the vector space ©. ἢ 


Theorem 2. Let 8 be any set of vectors in a space Ὃ, and let N be the set 
of all vectors in Ὃ which can be expressed as a linear combination of 
vectors in 8. Then, 


(1) ts alinear subspace of Ὅ; 
(2) every vector of 8 belongs to XN. 
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Moreover, St is the smallest such linear subspace, in the following sense: 


(3) 1 511 ἐξ a linear subspace which contains every vector of 8, neces- 
sarily St contains every vector of I. 

Proof. 

(1): If 8 is the empty set of vectors (i.e. contains no vectors at all), 
the convention is that 9t consists of just the vector 0. 

Otherwise, if the vector z belongs to 8, St contains 6 via the relation 
θ = Oz. Suppose zx and y are vectors in 9, say x = a Aev, and 
y= pie ujyj, Where the 2; and y; are in 8. Then, x + y is a linear 
combination of the vectors 21,+ " *,2n,¥/1,°**;Ym Of 8 (see T'heorem 3.1), 


hence x+y belongs to St. Also, Ar = oA (λλεὴχχκ shows that Ax 
belongs to St. Thus, 91 is a linear subspace. 


(2): If zisin 8, z = 1z shows that z belongs to XN. 


(3): Suppose 91 is a linear subspace which contains every vector of 
8. Then, 9% contains every vector of It by the Lemma. ἢ 


Definition 3. Jf 8 is a set of vectors in the space Ὃ, the linear subspace 
described in Theorem 2 is denoted [5], and is called the linear subspace 
generated by 8. 


Evidently, [0] coincides with U. If 0 is the vector space of n-ples, 
and 8 is the set of vectors 61," " “ν6,ῳᾳ described in § 4, then [8] coincides 
with U since every vector in U can be expressed as a linear combina- 
tion of the ey. Thus, different sets of vectors may generate the same 
linear subspace. 


Definition 4. Lei 3t be a linear subspace of U. A set 8. of vectors is called 
ἃ system of generators for Ii in case the linear subspace [8] coincides 
with ὅπ. Briefly, ὃ generates It. 

Another way of expressing what is going on in Theorem 2 is as fol- 
lows: 


Definition 5. Let 8 be a set of vectors in a space VU. A vector x is said to 
be linearly dependent on 8 in case there exist vectors x1,°- +, in ὃ such 
that x is a linear combination of the τι. Briefly: x depends on 8. 


The convention is that only the vector θ depends on the empty set 
of vectors. Comparing Definitions 8 and δ: 


I 
if 
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Theorem 3. 18 is any set of vectors in the space Ὃ, then the linear sub- 
space [8] generated by 8 is precisely the set of all vectors which are linearly 
dependent on ὃ. 


If x depends on 8, evidently z depends on any set of vectors which 
includes 8. If 9t is a linear subspace, every vector dependent on St 
actually belongs to Xt. 


Exercises 


1. Let Ὃ be the vector space of n-ples, and let 1,-+-+,a_ be fixed 


scalars. The set 9 of all vectors x = (Ay,°**,An) such that VBS QAR = 
0 is a linear subspace of Ὁ. 


2. Let U be the vector space of n-ples, and let (α;κ) be a system of 
fixed scalars (j = 1,---,m; k = 1,---,n). The set ὃ1 of all vectors 
ᾧ = (Ay,‘**,An) such that Ae ajax = 0 for 7 = 1,---,m, is 8 
linear subspace of Ὅ. [It is the set of all “solutions of a system of 
m homogeneous linear equations in πὶ unknowns.’’} 


3. Let Ὁ be the vector space of scalar-valued functions x defined 
on a set 5 (Example 1.2). The following are linear subspaces of Ὅ: 

(i) 9 the set of all finitely non-zero functions z, described in Ex- 
ample 1.7. 

(ii) Fix a set $ of points of 3, and let 2 be the set of all functions z 
which are 0 at every point of 8. 


4, If 9 and % are linear subspaces of Ὃ, the set of vectors common 
to 9% and 2% is a linear subspace of Ὁ. [This subspace is called the 
intersection of I and I.] 


5. Let U be the vector space of 3-ples. 

(i) If st is the set of all vectors y = (A,0,0), and is the set of all 
vectors z= (u,u,0), describe SM + 51. 

(ii) If 91 is the set of all vectors y = (a,8,0), and 91 is the set of all 
vectors z = (0,\,u), describe IN + 2. 


6. Let U be the vector space of scalar-valued functions x defined 
on the symmetric interval [—a,a]. Call x even if x(—it) = χ(ἢ for all 
t, and odd if x(—t) = —<x(t) for all t. The set 9% of all even functions 
is a linear subspace of Ὁ; so is the set 2 of all odd functions. Describe 
ὅπ -- I. 
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7. Let 3, 91 be linear subspaces of VU, and suppose @ is the only vec- 
tor common to 9% and 9. Then, the representation of a vector x of 
9 -+- Nin the form z = y + z, where y isin M and zis in MN, is unique. 
That is, if μι +2: = ye + Ze, necessarily y; = yo and 2; = Zo. 
Apply this result to zercise 6. 


8. Let 511 and N be linear subspaces of VU, and let 8 be the set of all 
vectors x which belong either to SW or to XN (or to both). Then, [8] is 
the linear subspace ὅλ + 91. 


9. Let Ὁ be the vector space of n-ples, $ the set of vectors x, = 
(1,0, - ὺ -,0), t2 = (1,1,0,- . +50); "> tn = (1,1, : -,l). Describe [8]. 


10. Let Ὁ be the vector space of polynomial functions on [a,b] 
(Example 1.4), and let 8 be the set of polynomials 1,t,t?,---,é". De- 
scribe [S]. 


11. Let U be the vector space of twice-differentiable functions on 
{a,b}, and let 8 be the set consisting of the two functions sin ¢, cos t. 
Then [8] is the vector space of solutions of the differential equation 
z’+2=0. 


12, If 1 is a linear subspace of Ὃ, then [51] is precisely 9. 


13. ὃ and 3 generate the same linear subspace 91 if and only if: 
every vector of 8 is a linear combination of vectors in 3, and vice 
versa. 


14. In the vector space Ὃ of n-ples, the following sets $ are systems 
of generators for Ὁ: 
(i) 8 consists of the n vectors e; = (1,0,---,0),-:-, en = (0,-> -,0,1). 
Gi) 8 consists of the πὶ vectors σι = (1,0,---,0), z2 = (1,1,0,---,0), 
τὴ ey = (1,1, ; sgh} 
| (ili) 8 consists of 2,---,t,, as in (ii), together with z= 
(1,2,3,0, na 0) and y (—1,0,5,0, ἐδ 0). 


δ 6. LINEAR INDEPENDENCE 


In the space of 3-ples, the vector e; = (0,0,1) cannot be expressed 
as a linear combination of the vectors δι = (1,0,0) and es. = (0,1,0); 
for, any linear combination \,¢, + doeo necessarily has 0 for its third 
component. One expresses this inexpressibility by saying that e; is 
linearly independent of 61,62. In general: 
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Definition 1. Let 8 be a set of vectors in the space VU. A vector x is said 
to be linearly independent of 8 if it cannot be expressed as a linear com- 
bination of vectors in 8. Briefly: x is independent of 8. 


Thus, x is independent of $ precisely when it does not depend on ὃ 
in the sense of Definition 5.5. Clearly x is independent of 8 precisely 
when it does not belong to the linear subspace [8] generated by 8; 
in particular, x ~ 6. The convention is that every non-zero vector is 
independent of the empty set of vectors. If z is independent of §, 
it is also independent of every set of vectors which is included in 8. 
If X is a linear subspace of Ὁ, the vectors independent of Xi are pre- 
cisely those which are excluded by 2. 

Consider, in the space of 3-ples, the vectors e,¢2,¢3 discussed earlier. 
It was noted that e3 is independent of e;,éo. It is equally clear that 60 
is independent of 61,65, and δι is independent of 85,64. Briefly, one says 
that ¢),€0,é3 are independent. In general: 


Definition 2. A set S of vectors is said to be linearly independent 77 case: 
each vector in 8 is independent of the remaining vectors of 8; that 18, no 
vector of 8 can be expressed as a linear combination of other vectors of ὃ. 
Briefly: 8. is an independent set of vectors. 


The convention is that the empty set of vectors is independent; 
otherwise, independent sets contain only non-zero vectors. The set ὃ 
consisting of a single non-zero vector x is independent. 

It is clear from Definition 2 that a set 8 is independent, if and only 
if every finite set of vectors 711." " "γῶν in 8 is independent. In practice, 
it is useful to have several reformulations of the condition for the in- 
dependence of 21,°**,%n: 

Theorem 1. The following conditions on the set of vectors x,- ++ ,X, imply 
one another: 


(a) 2,°°*,% are independent. 
(b) xy ts independent of x1,-+-,te-1, for k = 1,-+-,n. 
(ὁ) If ΣΟ dete = 0, necessarily λα = Ὁ for all k. 


(4) If DP date = Dy mere, necessarily dx = ux for all ke. 


Proof. 
The equivalence of these four conditions will be established by prov- 
ing the following propositions: (a) implies (Ὁ), (Ὁ) implies (6), (6) im- 
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plies (d), and (d) implies (a). The assertion in (b) for = 1 means 
simply that 2; γέ 6, in view of the conventions about the empty set 
of vectors. 


(a) implies (Ὁ): This is clear from Definition 2. 


(b) implies (c): Assume to the contrary that a relation >. Ane, = 8 
exists in which not all the d; are 0. If r is the largest subscript for 
which ἃ, ~ 0, then bie Ae = pas λχῦχκ = 6. Since 2, + 8 by the 
hypothesis (b), necessarily r > 1 (see part (5) of Theorem 2.1 ). Then, 
t, = ας (--ἀὰ}Ρ Ax) 2; but this contradicts (b). 


(c) implies (d): If 5 dee Aete = >,” mete, then τ: (At — με) = θ, 
hence λκ — μὰ = ὃ for all k, by the hypothesis (c). 

(d) implies (a): Assume to the contrary that for some index 7, 2; 
can be expressed as a linear combination of the remaining <x, say 
4) ΞΞ ΣΎΝ Ante. Defining A; = —1, one has Σ, het, = θ. Since also 
= pe Ozz, the hypothesis (4) yields \; = 0 for all k; this is absurd 
forrk=j. ἢ 


Condition (d) can be interpreted as follows: when a vector is ex- 
pressible as a linear combination of independent vectors 2j,-+-,n, 
the coefficients are unique. 

Suppose now that % is the linear subspace generated by a set ὃ 
of vectors. This means that 9 is the set of all vectors which can be 
expressed as a linear combination of vectors in 8. At times, unique 
expressibility may be desirable; this requires not merely a system of 
generators for 9, but an independent system of generators. It can be 
proved that every system of generators ὃ includes an independent 
system of generators 3; so to speak, one obtains 3 by suppressing cer- 
tain vectors of $, each suppressed vector being one which is still ex- 
pressible as a linear combination of the vectors of ὃ which remain. 
The translation of this idea into a formal proof requires some form of 
‘“transfinite induction”; however, for our purposes, the following 
special case will suffice: 


Theorem 2. Suppose the linear subspace I of Ὃ is generated by a 
sequence (finite or infinite) of vectors 21,%2,¥3," ++. Then, there exists a 
SUbseQUENCE LkyyTkqyLky*** Which (i) generates N, and (ii) ts linearly 
independent. 
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Proof. (after a fashion) 

If all the δὲ are 0, one takes the “empty subsequence” having no 
terms. Otherwise, let 2;, be the first non-zero ἀρ; then, for k < Κι, 
ry, = θ = Oxy. 

If all the x; for k > k; are scalar multiples of 2,,, the proof halts. 
Otherwise, let x;,, be the first 2, which is not a multiple of T,; then, 
for k < ko, zz is a multiple of 2;,,. 

If all the x, for k > ke are expressible as linear combinations of 
L,,Ck,, the proof halts. Otherwise, let x,, be the first 2; which is in- 
dependent of 2;,,7%,; then, for k < kz, zg is a linear combination of 
Lk, keg: 

The proof continues (or halts) inductively. Since k, < ke < kg < 
ἐλ the x, are a subsequence of the χε. Given any vector z in 9, say 
t= p Si λεῖκ. Since each of 2,-*+,%, is a linear combination of 
Lk," **,Lk,,, provided m is taken large enough, z is itself a linear com- 
bination of 2,,,-+-,2x,,; thus, the subsequence tk; generates 51, By 
construction, each zz, is independent of z;,,-++,v__,, hence the sub- 
sequence 2, is independent by Theorem 1. 


n—l? 


Exercises 


1, In the vector space of n-ples, the following vectors are inde- 
pendent: 2; = (1,0,--+,0), 15 = (1,1,0,---,0),-+-, ἄν = (1,1,---,1). 


2. In the vector space of finitely non-zero sequences (Hxample 
1.6), let x, be the sequence whose first & terms are =1, all other terms 
=Q. Every vector is uniquely expressible as a linear combination of 


the 2. 


3. In the vector space of continuous functions on [a,b], a < b: 

(i) The functions sint, cost are independent. 

(ii) The functions sin ἐ, sin cl are independent, provided οἷ ¥ 1. 

(iii) If z is any polynomial function, the functions z(é),e', sin ¢ are 
independent. 


4. In the vector space of polynomial functions on [a,b], where 
a <b, the functions 1,t,é?,---,¢" are independent. 


5. In the vector space of 2-ples: 
(i) Vectors z= (a,8), Ἧ ἘΞ (7,9) are independent if and only if 
ad — By ~ 0. 
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(ii) If x,y are independent, every vector z can be expressed as 8 
linear combination of z,y. | 
(iii) It is not possible to have three independent vectors. 


§7. BASIS, DIMENSION 


The problem of finding an independent generating system is easily 
solved for the following class of vector spaces: 


Definition 1. A vector space Ὃ is said to be finitely generated if there 
exists a finite set of vectors x,,+++,t, which generate Ὃ in the sense of 
Definition 6.4. 


Definition 2. A set ὅ of vectors is said to be a basis for a vector space Ὃ 
in case (i) 3 generates Ὅ, and (ii) 5 7s tndependent. 


Theorem 1. Every finitely generated vector space has a basis. 


Proof. 

By assumption, Ὃ has a finite system of generators 2,-*-,%,; sup- 
pressing x’s as necessary, we may suppose by Theorem 6.2 that the 
χὰ are also independent. Then, the zz are a basis. _ 


Examples 


1. In the vector space of n-ples, the vectors εἰ = (1,0,---,0), 
eg = (0,1,0,---,0),---, én = (0,---,0,1) are a basis. In the sequel, this 
will be referred to as the canonical basis for the space of n-ples. 


2. In the vector space of finitely non-zero sequences (Hxample 
1.6), the vectors 61 = (1,0,---), 65 = (0,1,0,---), e3 = (0,0,1,0,-- ); 

- are a basis, called the canonical basis. This vector space is not 
finitely generated. For, if 21,-++,%, is any finite set of vectors, there is 
a fixed index N such that the k’th component of any linear combina- 
tion of 2,--+,%, is Ὁ for allk = N. 


3. In the vector space of polynomial functions on [a,b] (Example 
1.4), where a < b, the functions 1,é,t°,--- are a basis (see Exercise 
6.4). This space is not finitely generated, since the lmear combina- 
tions of a fixed finite set of polynomials are of bounded degree. J 

If U is a finitely generated vector space, the number of vectors in 
any basis is the same; this is a consequence of the following theorem, 
known as Steinitz’s exchange theorem: 
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Theorem 2. Suppose 11." "πῃ is a system of generators for the vector 
space VU. Let y1,---,Ym be linearly independent vectors, with m <n. 
Then, there exists a rearrangement 1',---,n' of 1,---,n, such that the 
vectors Y1,°**;Ymy L(m+1)'y***, en generate VU. In other words, afier a 
suitable rearrangement, the first m x’s may be replaced by the y’s, in such 
a way that the resulting set of vectors generates V. 

Proof. (by induction on m) 

If m = 1, one is simply assuming y, τέ 6. Suppose y; = rae ALTE. 
Since ψι γέ 0, not every A; can be 0. Rearranging the 2’s, let us sup- 
pose A; ~ 0; then, one can solve for 2, as a linear combination of 
Y1,T2,*** ln, SAY 

(*) Dy = μὲμι + + ϑϑῶ μκῖκ. 

Tt will be shown that the vectors y1,%2,-++,@, generate Ὃ, Let x be 
any vector of Ὁ, and suppose z = 2 vytp. Replacing x; by the 
formula (*), x is expressed as a linear combination of ψ1,1.," «+ tn. 

Assume the theorem true for m — 1, and let independent vectors 
Yi,°**;Ym be given. The vectors y,--++,%m— are also independent; 
rearranging the z’s, we may suppose, by the inductive assumption, 
that the vectors ¥1,°**,Ym—1,Um,***,tn generate V. In particular, y,, = 
--“1η|- i * Β ‘ ‘ os, 
ει AiYi + pe μμῖδμγ for suitable coefficients. Since y, is inde- 
pendent of 4, " -,Ym—1, not every ux, can be 0. Rearranging x, - --,2n, 
we may suppose y,, ~ 0. Then, z,, can be solved for as a linear com- 
bination of ¥1,°--,Y%m—1;5Ym;m+41,°**,Tnj the concluding details are 
left to the reader. 


Corollary 1. /f Ὁ is generated by 115" ++, no independent set can con- 
iain more than n vectors. 


Proof. 

Assume to the contrary that y;,- “ “νην πα are independent vectors. 
Applying Theorem 2 to 21,+--,%, and ¥,--*,Yn, we conclude that 
Y1;°**;Yn generate Ὃ. In particular, y,4, depends on y;,:++,yn, a 
contradiction. J 


Corollary 2. If 2,-+-,%, and ;,---,Ym are bases for V, necessarily 
m= Nn. 


Proof. 
Since £1,*++,% generate U, and y1,+++,Ym are independent, m <n 
by Corollary 1; reversing the role of x’s and y’s,n<m. ἢ 
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Definition 3. Jf U is a finitely generated vector space, the (unique) num- 
ber of vectors in any basis is called the dimension of U; in this case, U 
is said to be finite-dimensional. If U is not finitely generated, it 18 said 
io be infinite-dimensional. 


As remarked in § 6, it can be shown that every vector space has a 
basis (what is needed, of course, is a proof for the infinite-dimensional 
case). For our purposes, the following aspect of infinite-dimensionality 
will suffice: 


Theorem 3. In any infinite-dimensional vector space VU, there exists an 
infinite sequence of linearly independent vectors x1,%9,%3,** +. 


Proof. 

Let x, be any non-zero vector; let x2 be any vector independent of 
a; let x3 be any vector independent of 21,42. Inductively, let rp+, 
be any vector independent of 2,---,%,; such a vector must exist, 
otherwise 2,,-**,%, would generate U. The x; are independent by 
criterion (b) in Theorem 6.1. ἢ 


Exercises 


1. In the vector spaee of n-ples, the vectors 2, = (1,0,---,0), 
%o = (1,1,0,---,0),--+, a = (1,1,-++,1) are a basis, 


2. In the vector space of finitely non-zero sequences, the vectors 
1 = (1,0,0, - F *), 2 = (1,1,0, - ' *), 3 = (1,1,1,0,- r *);° *+ are a basis. 


3. Let Ὃ be a vector space of finite dimension n. Then: 

(i) Every independent set of n vectors is generating. 

(ii) Every generating set of ἢ vectors is independent. 

(iii) If 9 is a linear subspace of VU, then 91 has finite dimension m, 
with m <n; any basis for 9% can be augmented to a basis of VU; 
m = nif and only if 91 is all of Ὁ. 


4. Let Ὃ be the vector space of finitely non-zero functions x defined 
on a set 3 (Hzample 1.7). For each point 8 of 3, let x, denote that func- 
tion whose value at sis 1, and which vanishes at every other point of 3. 
Then, the vectors x, form a basis for Ὅ. 


5. If U is generated by a sequence of vectors #1,%2,73,- ++, show that 
Ὃ has a basis. 


"Ὁ. Every vector space has a basis. 


24 = Introduction to Hilbert Space Ι §8 


§8. CODA 


On looking over the foregoing definitions and theorems, it will be 
apparent to the reader that no use has been made of any particularly 
characteristic property of the system of complex scalars (such as the 
“fundamental theorem of algebra,’’ real and imaginary parts, complex 
conjugation). The definitions and theorems make sense, and are 
valid, for other systems of scalars; for example, real scalars, or rational 
scalars, or Gaussian-rational scalars (complex numbers of the form 
α -+ 78, where a and § are rational). More generally, the scalars may 
be drawn from an algebraic system known as a field (see any book on 
abstract algebra). Thus, one speaks of complex vector spaces, real 
vector spaces, and so on, in conformity with the system of scalars which 
is employed. 

In the sequel, only complex vector spaces will be considered, and the 
terms vector space and complex vector space will be used inter- 
changeably. 


Exercises 


1. (Ὁ Every complex vector space can be regarded also as a real 
vector space. 

(ii) The complex numbers form a one-dimensional complex vector 
space (Zxample 1.8), and a two-dimensional real vector space. 

(iii) If Ὃ is an n-dimensional complex vector space, it is a 2n-di- 
mensional real vector space. 


*2. View the real numbers as a rational vector space in the obvious 
way. Then 1, +/2 are independent. So are 1, π. So are 1, +/2, =. 
What is the dimension? 
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§ 1. PRE-HILBERT SPACES 


The conjugate of a complex number \ will be denoted \*. Thus, if 
\ = a +78, where a and βὶ are real numbers, then A* = a — 78. 
The familiar properties of conjugation are as follows: (A*)* = A, 
(A + w)* = A* + u*, (Au)* = A*u*, [A] = ν᾿ λ᾽λ, and A* = α if and 
only if d is real. 


Definition 1. A pre-Hilbert space is a complex vector space ©. For each 
pair of vectors x,y of @, there is determined a complex number called the 
scalar product of x and y, denoted (x|y). Scalar products are assumed 
to obey these rules: 


(P1) (y|z) = (el\y)* 
(P2) (ὦ - ν]2) = ([2) + (la) 
(Ρ8) (λα! νὴ) = Aly) 
(P4) (|x) >0 when 2 568. 


A convenient verbalization of (x|y) is “x scalar y.’”’ The reason 
for the term pre-Hilbert space is that one can pass, by the procedure 
25 
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of “completion,” from a pre-Hilbert space to a Hilbert space (see 
Theorem V.2.1). 
Examples 

1. Let Φ be the vector space of n-ples (Example I.1.1). If + = 
(A1,**+,An) and y = (u1,-*+,4n), define 

(x|y) = = Apen™. 

The axioms (P1)—(P4) are easily verified. This example is known as 
n-dimensional unitary space, and will be denoted ©”. 


2. Let ® be the vector space of continuous functions on [a,b] 
(Example 1.1.8), where a < b. Define 


b 
(ely) =f aOy(o* at 
This example will be referred to as the pre-Hilbert space of continuous 
functions on [a,b]. 
3. Let ® be the vector space of finitely non-zero sequences (Ex- 
ample 1.1.6). lf x = (Ax) and y = (u,), define 
(z|y) = DOr deue*. 


Since this is essentially a finite sum, convergence is not an issue here. 
This example will be referred to as the pre-Hilbert space of finitely non- 
zero sequences. 


4. If @ is any pre-Hilbert space, and 91 is a linear subspace of Φ, 
obviously 91 is itself a pre-Hilbert space. 
Exercises 


1. Fill in the details in the above examples (especially Example 3, 
for which the verification of (P4) is non-trivial). 


2. The vector space © of complex numbers (Example 1.1.8) is a 
pre-Hilbert space, via (A|u) = Au*. [This is essentially the unitary 
space Θ᾽ of Example 1. 
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3. Let ® be the vector space of finitely non-zero functions defined 
on a set 3 (Hxample I.1.7). Define 


(z|y) = Do, Oy", 
the sum being extended over all ¢ in 5. Then, is a pre-Hilbert space. 


4. Let ® be any vector space of finite dimension n, and 2,--+,2_ 
any basis of Φ, If x = τ Apt, and y = ΤῊ μιῖῦκ, define (1 9) = 
=. Anue”. Then, Ὁ is a pre-Hilbert space. 


§2. FIRST PROPERTIES OF PRE-HILBERT SPACES 


Axioms (P2) and (P38) for a pre-Hilbert space can be expressed as 
follows: the scalar product (x|y) is “additive” and “homogeneous” 
in the first factor. The first two statements of Theorem 1 assert that 
(x|y) is “additive” and “conjugate-homogeneous” in the second ἴβο- 
tor: 


Theorem 1. In any pre-Hilberi space: 

(1) @y+z) = @y) + (2) 

(2) (αἰ λῳ) = A*(z|y) 

(3) (@ly) = (:[8) =0 

(4) (ὦ -- ν]2) = @lz) — (lz) 

(αἰν — δ) = (zly) — (a2) 

(5) Jf (x\z) = (yz) for all z, necessarily x = y. 

Proof. 


(1): Using axioms (P1) and (P2), (ἰψ -Ἐ 2) = (y+e|z)* = 
L(y|x) + (|x)]* = (ν| 1)" + @|z)* = (εἰν) + (2). 

(2): Using axioms (P1) and (P3), (x|Ay) = (Ay|z)* = [A(y|a)]* = 
A*(y|x)* = A* (aly). 

(3): @|y) = @+64ly) = Oly) + @ly), hence (|y) =0. Simi- 
larly, (4 [0) = 0. 

(4): (@ — ylz) = @ + (—y)l2) = (2) + (—ylz) = (2) + 


((—)yl|z) = @l2) + (-- ἡ 6) = (αἰ) — (ψ[). Similarly for the 
second relation. 
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(5): Suppose (x|z) = (y|z) for all z. Then, (x — y|z) = (5 [2) — 
(y|z) = 0 for all z; in particular, (zx — y|z — y) = 0, hencex — y = @ 
by axiom (P4). J 


Exercises 
ΝΣ any pre-Hilbert space, 
(oy Atel y) = DO) λεία! ν) 
(w| Dy mivs) = Σὰ mil ys) 
(Do, Ante] Do, μήν = DR CLD 


§3. THE NORM OF A VECTOR 


Definition 1. In a pre-Hilbert space, the norm (or “length”) of a vector 
x, denoted || x ||, is the non-negative real number defined by the formula 


| z || = ν (]}}2). 

Suggested verbalization of || x ||: “norm a.” 
Theorem 1. Jn a pre-Hilbert space: 

(1) | Ae] = [Al el. 

(2) [2 || > O when 1 γέ 6; || «|| = O7¢f and only tf x = 8. 
Proof. 

(1): |] Aw ||? = (λα λα) = W*(e|z) = |Al? |] α [5. 


(2): This is immediate from axiom (P4), and the relation (6|6) = 
0. [ 


In particular, || —z || = || z || and || cx || = |||]. If « #6, the 
vector || z ||~* x has norm 1. 


Examples 


1. In the unitary space Θ᾽ (Hxample 1.1), if x = (Ag), 
Μ 


121 = (ΣΡ) 
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2. In the pre-Hilbert space of continuous functions on [a,b] (#2- 


ample 1.2), 
| x || = (f i208 a) - 


3. In the pre-Hilbert space of finitely non-zero sequences (Hxample 
1.3), if t= (Ax), 


τά 
151 - (ΣρινΡ)}.. ἢ 
The additivity of the scalar product yields an identity expressible 
simply in terms of norms: 
Theorem 2. (Parallelogram law) Jn a pre-Hilbert space, 


[5 Ἐν [[5.-Ὁ {5 -- ν [3 Ξ 21} 2)? +2] yl). 

Proof. | | 
One has || 2 + y |? = @+y|z+y) = Gla) + (z|y) + (y|z) + 
(ly) = zl? + ly’ + (αν) + G2). Replacing y by —y, 
5 --τν "ΚΞ [2 5" ἘΊ ν 15 -— @ly—- Ge. ἢ 

The norm of a vector is expressed, by definition, in terms of the 
scalar product. There is a useful formula which expresses the scalar 
product in terms of norms: 


Theorem 3. (Polarization identity) In a pre-Hilbert space, (x|y) = 
a{ii¢+ylP—lle-yP+clo+ yl? τὸ [α — ἐν {5}. 


Proof. 
In the identity 


(8) Jjety|P=|cl?+lly I? + @ly + Gla), 
replace y by —y, zy, and —tiy: 
Ic—yl? = [115 +iylP — ely) — ola) 
| e+ zy ||? = lla ll? + lly ll? — e@ly) + ey|2) 


|x — zy |? = [5 [5 + yl? + ἐδ! ν) — 2). 
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It follows that 
(b) “--ἰ ὁ -- ψ ||" Ξ -- [2 1" -- {ν 5 ἘΞ @ly) + Giz) 
() ἐ{|{π τ ἡ [5 Ξ ὁ} cl? +2llyl?+@ly -- gla 
(ἃ) --ἰ  } «Σ -- ἐν |? = --ἶ"} 1 [3 -- ἐ]} yl? + @ly) — Gla. 
Adding (α)--(ἃ}, the right hand side reduces to 4(x|y). J 


From the definition of norm, (|x) = || || || 2]. In general, 
|(x|y)| is dominated by the product of || || and || y ||: 


Theorem 4. (Cauchy-Schwarz inequality) In a pre-Hilbert space, 
(zly)|<il<ll iy. | 


Proof, 
If x = 6 or y = 6, then (x|y) = 0, and the conclusion is clear. 
Suppose, for instance, that y ~ 6. Dividing through the desired 
inequality by || y ||, the problem is to show that |(x|z)| < || z || when 
|| z || = 1. For every complex number 4, 


|| 2 — dz ||? = |] « |]? — δ" 12) — Al 4) + Ad* || z ||? 
= || «|? — αἰ )λ" — A@|z)* + λὰλ" 
= || «|? — @lz)(e|z)* 
+ (x|z)(z|z)* — (x|z)A* — A(x] z)* + AA* 
= || x ll? — |@lz)? + [@lz) — All@lz) — Δ] 
= || =|? — |@le)? + |@lz) — AP. 
In particular, for λ = («|z),0 < || © — doz ||? = || ||? — [( [2}}5. } 
Applying Theorem 4 in the unitary space C” (see Example 1), 
Corollary. If \1,°-*,An and ,°**,4n are complex numbers, 


| Dd, Anen* | < (sti e) (Stim 2). 


The Cauchy-Schwarz inequality leads to an inequality involving 
only norms: 


Theorem 5. (Triangle inequality) In a pre-Hilbert space, one has 
le+ylsiel+iyi. 
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Proof. 

Denote by p{A} the real part of the complex number ἃ; obviously 
|e{A}|<|A|. Applying the Cauchy-Schwarz inequality at the ap- 
propriate step, || « + y ||? = || ς [15 + || yl? + @ly) + @ly)* = 
Il 2 ll? + lly I]? + 2 {(ely)} < lla |? + {{ν Il? + 2|(Σ|Ψ}} < {τ IP 
+ilyl?+2cilvgl=(cl+iyb* Ὁ 


Consider, for example, the unitary space @”. A vector « = (a,a9) 
with real components may be interpreted, in the Cartesian plane, as 
the “arrow” from the origin (0,0) to the point (a,,a2). Then, || 2 || is 
the familiar formula for the distance from (0,0) to (a,a2). If also y = 
(8;,82), with real components, x + y represents the resolvent of the 
arrows x and y: 


(β.: β.) 


(α: Ἔβι, α3 ἘΒ9) 
(--βι. 8.) 


(α1; α 5) 


(α,--β., Q»o —B.s) 


The triangle inequality and parallelogram law can now be given the 
obvious geometrical interpretations from which their name is drawn. 

It is customary to carry over this geometrical language to pre- 
Hilbert spaces of arbitrary dimension: 


Definition 2. In a pre-Hilbert space, || x — y|| is called the distance 
from x to y. 


Certain properties of the norm have natural formulations in terms 
of distance: 
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Theorem 6. In a pre-Hilbert space, 
(1) [15 --’ ν "" 20; [[α -- ψ } =0 ¥ and only f c= ν. 
2) [15 -- ν]}" Ξε [ν -- ἁίὶ 
(8) 1} --, Σ]} “ [{5 --τν [ἘΠ ν -- 2]. 
Proof. 
presen (2) are clear from Theorem 1, and the relation y — ἃ = 


(3) results from the relation x — z= (x — y) + (y —2z), and 
Theorem ὃ. ἢ} 


Exercises 


1. If x and y are continuous complex-valued functions on [a,b], 
then 


| ὃ ab 
P< f le@ra- [ναι 


ὃ 
f a(t)y(t)* dt 


and 


(f ‘ln Ἐνῷ! it) < ( ‘lz? at) + (f ‘vor at) 


2. Let x and y be vectors in a pre-Hilbert space. If x = θ or y = 8, 
the Cauchy-Schwarz inequality reduces to 0 = 0. Assuming 2 + 0 
and y τέ 6, show that |(«|y)| = || x]! || y || if and only if z and y are 
“proportional” (i.e. y = Ax for suitable X). 


3. In a pre-Hilbert space, | || «|| — |] y || |< [{ --- ψ |!. 


4, Let x and y be non-zero vectors in a pre-Hilbert space. The rela- 
tion || z+ y|| = || z |] + || y || holds if and only if y = az for some 
real number a > 0. 


5. Let x,y,z be vectors in a pre-Hilbert space. The relation || 2 — z || 
= ||2 — y|| + || y — z|| holds if and only if there exists a real num- 
ber a, O< a <1, such that y = αὐ + (1 — a)z. 


6. Let x and y be non-zero vectors in a pre-Hilbert space. The rela- 
tion || x —y|| = | ||] — || y |] | holds if and only if y = ex for 
some real number a > 0. 


Ι͂ §4 Hilbert Spaces 33 
7. In view of Exercises 4 and 6, one has || x Ἐν || = | x || a ll y | 
if and only if || x — y || = | || z || — || y || |. This is also an immediate 


consequence of the parallelogram law. 


8. Deduce another proof of the Cauchy-Schwarz inequality, using 
the polarization identity and the parallelogram law. 


§4. METRIC SPACES 


At the close of the preceding section, a notion of distance between 

vectors in a pre-Hilbert space was introduced. The role of this concept 
is clarified by abstracting certain essentials: 
Definition 1. A metric space zs a sel X, composed of objects called the 
points of the space. It is assumed thal ὃ. is non-empty (that is, contains 
at least one point). For each pair of points x and y of the space, there ts 
determined a non-negative real number d(x,y), called the distance from 
x to y, subject to the following axioms: 


(M1) d(ay)>O0 when ry; 
d(z,y) = 0 <wandonlyif r=y 
(M2) d(x,y) = d(y,x) 
(M3) d(z,z) < d(x,y) + d(y,2). 
In words, distance is strictly positive, symmetric, and satisfies the 
triangle inequality. 


Examples 


1. Every pre-Hilbert space @ is a metric space, with d(z,y) = 
l|a — y || (see Theorem 3.6). 


2. A non-empty set % of complex numbers is a metric space, with 
d(d,u) = | Ts al. 


3. Let X be any non-empty set. Define d(z,y) = 1 when ἃ ¥ y, and 
d(x,z) = Ὁ for all x. It is easy to see that X is a metric space; such 
metric spaces are called discrete. 

4. If αὐ is a metric space, and $ is a non-empty set of points of 8, 
then § is itself a metric space (distances in 8 being measured as they 
already are in 80). 
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5. Let ἃ be the vector space of continuous functions defined on [a,}] 
(Example 1.1.8), where a < b. Define d(z,y) = LUB {|2(t) — y(®)|: 
a<t< b}; that is, d(z,y) 1s the least upper bound of the numbers 
|a(é) — y(t)|, as ¢ varies over the interval [a,b]. Then, X is a metric 
space. [Note: a continuous function on a closed interval zs bounded.] J 


Certain standard notions from the calculus of real numbers have 
natural generalizations to metric spaces: 


1. A sequence of points x, in a metric space is said to converge to the 
point x in case d(r,,r) — Ὁ asn — οὐ, This means: given any number 
e > O, there is an index N such that d(z,,2) < « whenever n > N. 
The point x is then unique; for, if also d(zn;,y) — 0, then z = y re- 
sults from 0 < d(z,y) < d(z,a,) + d(tn,y) τὸ 0+0. The point z 
is called the limit of the sequence ἄρ. Notations: t, — ἃ, or tn τὸ ὦ 
as ἢ — ©, ΟΥ ἃ = lim @,, etc. ... 


2. A sequence x, is said to be convergent if there exists a point x 
such that z, — zx. Otherwise, the sequence is said to be divergent. 


3. A sequence x, is said to be Cauchy in case d(2m,t%,) — Ὁ as 
m,n —» ©, This means: given any number e > 0, there is an index Ν᾽ 
such that d(%m,%,) < ε whenever m,n > N. Every convergent se- 
quence is Cauchy; for, if x, — x, then d(%m,%). < d(@m,v) + d(x,tp) 
- 0+ 0 as mn — ~. Not every Cauchy sequence is convergent, 
as is shown in the following example (more sophisticated examples 
are given in Examples 7 and 8): 


Example 6. Let X be the open interval (0,1) with d(a,8) =|a — B|. 
Let a, = 1/n (n = 1,2,3,---). Then, a, is Cauchy (since it is con- 
vergent in the metric space of all real numbers), but the only pos- 
sible limit (namely 0) lies outside of x. J 


In the metric space of all real numbers, with d(a,8) = [ἃ — Bl, 
every Cauchy sequence is convergent (this is the well-known Cauchy 
criterion for convergence). This is an example of an important type 
of metric space: 


Definition 2. A meiric space X is said to be complete in case every 
Cauchy sequence is convergent. Otherwise, X is said to be incomplete. 


Examples of complete metric spaces are given in the exercises, and 
elsewhere in the sequel. The property of completeness owes its im- 
portance, to a certain extent, to the existence of prominent metric 
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spaces Which do not possess it; two examples of such spaces, both 
pre-Hilbert spaces, will now be given. 


Example 7. Let @ be the pre-Hilbert space of finitely non-zero se- 
quences (Hzample 1.3), with d(x,y) = || x — y||. It will be shown that 
Φ is an incomplete metric space, by exhibiting a Cauchy sequence that 
has no limit in ®. The proposed sequence 2, is 


τι = (1,0,0,-++) 
2 = (1,1/2,0, - : +) 
i3 = (1,1/2,1/3,0, - -*) 


Zn = (1,1/2,1/3,-++,1/n,0,°+*). 
For all n,p = 1,2,3,-::, 


Ι IP (0 ieee a" ) 
| Tr - ta = | add —— 9 ἢ #*# & 5 - a EAS 
mae Sarre ae ee ee | 


2 


since the series ir ae 1/k? is convergent, d(tn4p,¢n) = || Tn4p — Xn || 
— JOasn — o. It follows that x, is a Cauchy sequence of vectors. 

Suppose (to the contrary) that ® contained a vector x = (Aj,A2,Az, 
ὧς - Awv,0,0, - : -) such that In —> &. If " = N, 


len — ἢ = ΣῚ 


1 2 
k a ont DP? 


= + 0; 


1 

_ 
eee 2 

letting n — @, ΣΤ [τ -- | = 0, hence λᾳ = 1/k for all k. This 

contradicts the assumption that is finitely non-zero. J 


Example 8. Let @ be the pre-Hilbert space of continuous functions on 
a closed interval, say [—1,1] for simplicity. Distances are defined as 
in Hxample 1: 


1 ἐξ 
dew =N2-y=(f ie - sora) . 
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It will be shown that @ is an incomplete metric space. The proposed 
Cauchy sequence without a limit is the sequence whose n’th term 7, 
has the following graph: 


Yn, 


[Ὁ for —1<t<0 
z,(t) = πὲ for 0<i<li/n 
1 for i/nst<l1. 


By elementary calculus, || tm — tn || — 0as m,n — ©; specifically, 
ifm > n, 
1 | (m a) n)* 
fF lent - 20 Pat = =. 
.-Ἰ om Th 
Assume to the contrary that Φ contains a (continuous) function z 
such that z, — 2, that is, 
I 
f \aa(t) — x(t) |2 dt — 0. 
=i 
Since the integrands are >0, 
b 1 
f |an(t) — x(t) |? dt < [ ᾿πβ(ὃ — a(t) |? αἱ 
“a -ἰ 
for any sub-interval [a,b] of [—1,1], hence 


ὃ 
f leno -- 2 Pat > 0. 

In particular, | ; 
[le -- 20 P at 0; 

- 
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since Z,(é) = 0 on [—1,0], this reduces to 


0 
ff izoPat > ὁ, 
in other words, A 
| x(t) |? dt = 0. 
-ἰ 


Since 2 is continuous, it follows that χί) = 0 on [—1,0]. 
Suppose 0 < « < 1. One has 


1 
[ [σε(ὃ — x(é)|? dt > 0. 
é 
But, τρί = 1 on [e,1], provided n > 1/e, hence 
ἽΝ 1 
fin - ὠμὰ - [τ-- το αι 
for n > 1/e; letting n — ©, it follows that 
1 
f [1 — x(t) [3 dt = 0, 
hence x(é) = 1 on [e,1]. Since e > 0 is arbitrary, r(¢) = 1 on the semi- 
open interval (0,1). Thus, 
0 for —1<i<0 
a(t) = 
1 Ὁ δεἐς 1. 

This contradicts the assumption that zis continuous. J 

Returning to general metric spaces, 
Theorem 1. In any metric space: 

(1) |d(z,2) vi d(y,z) | < d(x,y) 

(2) Ifa, > cand yn — y, then d(%n,Yn) > d(x,y). 

(3) Ifa, and yn are Cauchy sequences, then d(Xn,Yn) 18. a convergent 

sequence of real numbers. 

Proof. 
_ (1): By the triangle inequality, d(z,z) < d(x,y) + d(y,z); transpos- 
ing, d(x,z) — d(y,z) < d(«,y). Interchanging the role of z and y, 
ad(y,z) τ' d(x,z) < d(y,2), that is, —d(x,y) - d(x,2) ἫΝ d(y,2). 
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(2): Using part (1), and the triangle inequality for real numbers, 
|d(z,y) — A(en,Yn)| S|d(a,y) — d(en,y)| + |dG@n,y) — €@n,Yn)| S 
d(x,%n) + d(y,yn) ~ 0+ 0asn -- ὦ. 


(3): Similarly, |d(2m,%m) — (%nYn)| < @(tm;tn) + A(Ym;Yn) — 0 
+0 as m,n — οὐ, Since the real numbers are complete, the Cauchy 
sequence d(x,,Yn) converges. ἢ 


The concept of a bounded set of real numbers can be generalized 
as follows: a set ὃ of points in a metric space is said to be bounded in 
case the distances d(z,y) remain bounded as x and y vary over 8; that 
is, there is a constant M > 0 such that d(z,y) < M whenever z and 
y are points of 8. 


Examples 


9. Let zo be a fixed point of the metric space 9%, and let ε be a fixed 
real number >0. The set 8, of all points z in 80 such that d(z,%) < «, 
is bounded; for, if xz and y are two such points, d(x,y) < d(x,%) + 
d(xo,y) < 2e. 8 is called the open ball with center x» and radius ε. In 
particular, when % is the metric space of real numbers, $ is the open 
interval (1% — €,% + €). 


10. Similarly, the set 3 of all points x such that d(x,7) < ε is 
bounded; it is called the closed ball with center τὸ and radius ε. In 
particular, when 90 is the metric space of real numbers, 3 is the closed 
interval [zp — €,2% + εἰ. 


11. Suppose τὸ is a fixed point of the metrie space X, and 8 is a 
set of points of 9. Then, 8 is bounded if and only if the numbers 
d(x,29) are bounded as z varies over 8. For, suppose d(z,%9) < Καὶ for 
all z in 8; if 2 and y are points of 8, d(x,y) < d(x,%) + d(%,y) < 2K, 
thus 8 is bounded. Conversely, suppose 8 is bounded, say d(x,y) < M 
for all x and y in 8; fix any point yo of 8; then d(x,29) < d(z,yo) + 
A(yo;%o) < M + d(yo,%o) for all x in 8. 


Exercises 


1. In a discrete metric space (Example 3), x, — x if and only if 
there is an index N such that 2, = αὶ for alln > N. 

2. In the metric space X of Example ὃ, τὰ —> x means that z,(t) τὸ 
z(t) uniformly for ¢ in [α,}]. 
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3. In the metric space C” (see Example 1), x, — x if and only if: 
for each k = 1,-+-,m, the k’th component of x, converges to the k’th 
component of x. 


4. In the metric space C”, a sequence z, is Cauchy if and only if: 
for each k = 1,---,m, the sequence of k’th components is a Cauchy 
sequence of complex numbers. Deduce that ©” is complete. 


5. Every discrete metric space is complete. 
6. The metric space X of Example 5 is complete. 


7. If Sisa non-empty set of points in a metric space &, the follow- 
ing conditions on ὃ are equivalent: 

(a) 8. is bounded; 

(Ὁ) given any point z of &X, there is a ball centered at z (i.e. there 
exists a suitable radius ε > 0) which includes all the points of 8; 

(c) there exists a ball containing all the points of 8. 


8. In a metric space, the following statements are equivalent: 

(a) Zn — 2; 

(b) each ball centered at x contains all but finitely many of the 
terms Zn} | 

(ec) each open ball containing z contains all but finitely many of 
the terms Zp. 

In part (6), can one replace ‘open ball” by “closed ball’’? 


§5. METRIC NOTIONS IN PRE-HILBERT SPACE; HILBERT SPACES 


Let us translate the general metric space concepts of the preceding 
section into pre-Hilbert space terms; the dictionary 18 Example 4.1: 


1. A sequence of vectors 2, converges to the limit vector x in case 
| ct, —z|| τὸ 0 as n — ©; that is, given any « > 0, there is an 
index N such that || x, — x || < ¢ whenever n > N. The vector z is 
then uniquely determined by the sequence z,. Notations: z, — 2, 
Oz, > rasn — ©,orz = limp, etc. ... 


2. A sequence of vectors x, is convergent if there exists a vector x 
such that ὦ, — x. Otherwise, the sequence is divergent. 


3. A sequence of vectors 2, is Cauchy in case || 2m — zp || — 0 as 
m,n -- o; that is, given any « > 0, there is an index N such that 
|| tm — 2, || < ¢ whenever m,n > N. Every convergent sequence is 
Cauchy; the converse fails (see Examples 4.7 and 4.8). 
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4. A set 8 of vectors is bounded if there is a constant MM > 0 such 
that || x || < M for all x in 8 (see Example 4.11). In particular, a se- 
quence of vectors x, is bounded if there is a constant 1 > 0 such 
that || x, || < M for all n. If x9 is a fixed vector, and ¢ > 0, the set 
of all vectors x such that || « — zy |! < €is the open ball with center x 
and radius ε; the closed ball, with center xp and radius ε, is the set of 
all vectors x such that [ἃ — 2 || < ε; the set of all vectors x such 
that || 2 — 29 || = ¢ is called the sphere with center x) and radius e. 
In particular, the open unit ball, closed wnit ball, and unit sphere are 
defined, respectively, by the conditions || x [ < 1, || z |] <1, and 
{[ τ = 1. 


5. A pre-Eilbert space is complete in case every Cauchy sequence 
converges. That is, if || ἀρὰ — 2, || — 0, there exists a vector x such 
that || z, — x || - 0. 


Definition 1. A complete pre-Hilbert space is called a Hilbert space. 


In the sequel, the letters σὺ and *® will invariably denote Hilbert 
spaces. The most important example is the following: 


Example 1. The Hilbert space 17. Denote by 3¢ the set of all sequences 
4 = (λμ) of complex numbers which are absolutely square-summable, 
that is, D7) |Ax|? < ὦ (equivalently, the finite sums >>, |x|? have a 

| M4 
bound independent of n). For such an 2, define N(x) = (x7 AK x 
If x = (Ax) and y = (μι), write x = y in case Ay = με for all k. 


Lemma 1. If « = (Ax) and y = (us) are sequences belonging to i, 
then so is the sequence (Az + uz), which is denoted x + y. 


Proof: 
By the parallelogram law for complex numbers, |Az + ue|? + 
[Az — wel? = 2|Ax|? + 2| ux]?, hence 


Dolan + wel? <2 Dj dg|? +2 ΣΝ 


for all n. Clearly 37" |\x + #x|? « %, by the “comparison test.” ἢ 


If x = (Ax) belongs to 3C, and ἃ is a complex number, Dal AAR 2 = 
|r|? a x |” shows that the sequence ()A,) is absolutely square-sum- 
mable; it is denoted Az. 
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It is easy to see that 0 is a vector space, with respect to the opera- 
tions x + y and λα. Incidentally, N(Ax) = || N(x), and the proof of 
Lemma 1 shows that N(a + y)? + N(@ — y)? = 2N(z)? + 2N(y)?. 


Lemma 2. If ᾧ = (Ax) and y = (ux) belong to JC, the series yp a λεμιῖ 
converges absolutely. 
Proof: 

If aand b are real numbers, (a — b)? > 0 leads to ab < 3(a® + b*); 
in particular, |Ague*| =|Ax| | wx] < 3(|Ael? +] ae[?), thus ΣΝ | Ague*| 
converges by the comparison test. 


Lemma 2 justifies the definition 


(aly) = do) λέω", 


The axioms for a pre-Hilbert space are easily verified. Incidentally, 
| z || = N(@). 

To show that 3¢ is a Hilbert space, it remains only to prove com- 
pleteness. Suppose z',z”,23,--- is a Cauchy sequence in δῦ, that is, 
|<” — zx" || -- Oasmn — οὐ, Say x” = (Aj). Foreach k, [λζ" — XZ)? 
- 2 a | — AF? = || 2" — 2” ||? shows that the sequence Aj,Az,Az, °° * 
of k’th components is Cauchy. Since the complex numbers are com- 
plete, 4; — Ax as ἢ — ©, for suitable ἂρ. It will be shown that 
2 Be lA |? « ὦ, and that αὖ converges to x = (Ax). 

Given ¢ > 0. Let p be an index such that || 2” — x” ||? < « when- 
ever m,n > p. Fix any positive integer r; one has 


Dy laF — MP? < {| 1 -- 2" |? Se, 
provided m,n > p; letting m -- ~, 
Σ ἶλε — MI? « ε 
provided n > p; since r is arbitrary, 
«*) 2 Na -- "5 < «, whenever n> p. 
In particular, 5>°|\x — A2|? < ε, hence the sequence (Ax — X¥) be- 
jJongs to σοῦ; adding to it the sequence (Af) of 5C, one obtains (Ax), 


thus x = (A,) belongs to 50. It follows from (*) that || ὦ — τῇ ||? < ε 
Whenever n > p. Thus, αὖ — z. 


LL Se ee a ee yee τς 
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The Hilbert space 50 of absolutely square-summable sequences is 
denoted /?. J 


We resume the discussion of general pre-Hilbert spaces by noting 
some properties of Cauchy and convergent sequences of vectors: 
Lemma. Every Cauchy sequence is bounded. 

Proof. 

Given a Cauchy sequence 2p, let N be an index such that || 2, — 2m || 
< 1 whenever m,n > N. If n > N,|| ta || = || (tn — tw) + tn || < 
| t_, — ty || + | zw || <1+ Vex ἢ: Thus, if M is the largest of 
the numbers 1 + || xy {}}} 21 [I,l| 2 ll,->+sl] eva |], one has || zn || < 
M foralln. J 


Theorem 1. /n any pre-Hilbert space: 
(1) If zt, — zand y, — y, then (2n|\Yn) — (rly). 


(2) If x, and yp» are Cauchy sequences of vectors, then (τῳ [ Yn) 18 α 
Cauchy (hence convergent) sequence of scalars. 


Proof. 


(1): For all π, (%n|yn) — (ly) = @n — αἴ ψαη — y) + lyn — y) | 


+ (x, — x|y). Using the triangle inequality for complex numbers, 
and the Cauchy-Schwarz inequality, one has |(tnlyn) — (rly)| < 
[] 50) --- 2 lon—yvil+izilllgan—yll+llan—2]] lly]; clearly 
the right hand side — Oasn — οὐ, 


(2): Similarly, | (%nlyn) — (tm | Ym) |< |] en — τὰ || |] Yn — Ym || + 
l| 2m Il |] Yn — Ym || + |] 2a — Lm {{Π|| Ym || for all m and n; since || 2m || 
and || ym || are bounded by the Lemma, the right side — Ὁ as 
mn —> 0, 


Corollary. Jn any pre-Hilbert space: 
(1) If t, — &, then || zp || > || }}. 
(2) If x, is Cauchy, then || x, || converges. 


Exercises 
1. Fill in the details in Example 1. 


2. If St is a linear subspace of a pre-Hilbert space ©, and ὃῖ con- 
tains a ball or a sphere, then 9t contains every vector of @. 
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3. In a metric space, every Cauchy sequence is bounded. 

4, If t, — θ and y, is bounded, then (z,|y,) — 0. 

5. Another proof of the Corollary to Theorem 1 results from the 
inequality | || z|| — |lyll| |< 1} « -- yl. 

6. If || x, || — || 2 ||, and (ὦ, 1] — (2|z), then ὦ, — z. 

7. The argument in Example 4.7 can be concluded as follows. 
Assume to the contrary that there is a vector x = (\;) such that 


In τσὶ 1. Let = (1,0,0, - - *), ὅθ = (0,1,0,---), es = (0,0,1,0,---),++>. 
For each k, (a, [ 6.) — (x|ex) = Ax. But (ap|e,) = 1/k for n => k. 


§6. ORTHOGONAL VECTORS, ORTHONORMAL VECTORS 


Definition 1. /f x and y are vectors in a pre-Hilbert space, one says that 
z is orthogonal (or “perpendicular’”’) to y in case (x|y) = 0. Notation: 
τιν. 

Suggested verbalization of x L y: “x perp y.” The relation οἵ ortho- 
gonality is symmetric: if « 1 y, then y 1 2; this results from (y|z) = 
(x|y)*. If x 1 ἃ, necessarily ἃ = @. Every vector x is orthogonal to θ. 


Theorem 1. Jf x is orthogonal to each of ;,-++,Yn, then x is orthogonal 
to every linear combination of the yy. 


Proof. 


lfz 1 y forall k, andy = pie Axye, then (x| y) = ao An *(x| yx) = 
DU eO=0. ἢ 


Definition 2. A set 8 of vectors is said to be orthogonal in case x 1 y 
whenever x and y are distinct vectors in 8. A sequence (finite or infinite) 
of vectors Z, ts called an orthogonal sequence if x; | x, whenever 
j γέ k. 


Examples 


1. Let 4,A2,A3,°*- be any sequence of scalars. In the mtn 
space * finitely non-zero sequences, define x; = (A,,0,0,---), a = 
(0,A2,0,--+), 2g = (0,0,A3,0,---),-++. Then, 2, is an orthogonal se- 
quence of vectors. 


2. In the unitary space δ, the vectors 7; = (1,2,2), #2 = (2,1,—2), 
= (2,—2,1) are orthogonal. Incidentally, || x, || = 3 for all k. 
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3. In the pre-Hilbert space of continuous functions on [—7,7], let 
the sequence of functions z, be defined by the formulas z,() = sin (nt) 
(n = 1,2,3,-+-). The sequence z, is orthogonal, that is, 


J sin (nt sin (nt) at = 0, when m γέ ἢ. 


Similarly the sequence yn(t) = cos (nt) (n = 0,1,2,3,-++) 18 orthogo- 
nal. Moreover, 2m L yn for all m and n. 


Theorem 2. (Pythagorean relation) If x 1 y, then 
[5 Ἐν [5 =i? + ly /P- 
More generally, if 21,--+,%n are orthogonal, 


I] ΣΤ ae |]? = Dy Ul ae I. 


Proof. 

Ifzly|ie¢t+yl? = Glo + @ly + (15) + Gly ΞΞ [[ 11" + 
0+0-+ || y|I*. This is the case πὶ = 2. Assume inductively that 
| my 2, ||? = a || 2; ||?. Setting x = » eee aj, and y = Zp, one 


has x 1 y by Theorem 1; then, || D0) σα ||? = || z+ y |)? = lel? + 


ty l2= dD Weel? + onl? ὁ 


Corollary. If 21,22,03,--- is an orthogonal sequence (finite or infinite) 
of non-zero vectors, the x, are linearly independent. 


Proof. 

Suppose ΧΙ ete = θ. The vectors \121,°**, Ann are clearly or- 
thogonal. By Theorem 2,0 = || 6 ||? = 0) |i Avwe ||? = 20; |Ael? Il ae ll?5 
since || x; || > 0 for all k, necessarily \, = Oforallk. ἢ 


Definition 3. A set 8 of vectors in a pre-Hilbert space is said to be ortho- 
normal in case (i) $ is orthogonal in the sense of Definition 2, and 
(ii) || x || = 1 for every vector x in 8. A sequence (finite or infinite) of 
vectors x, is called an orthonormal sequence if (i) x; 1 x, whenever 
2 # k, and (ii) || v || = 1 for all k. 


The condition for the orthonormality of a sequence can be ex- — 


pressed as follows: (x;|2,) = 4j,- In general, the “Kronecker delta” 
symbol διε can be defined for 8 and ¢ varying over a set 5; it has the 
value 0 when 8 γέ ὁ, and 1 when 8 = ¢. For example, in the notation 
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of Exercise I.7.4, xs(t) = 6,,. A set S of vectors is orthonormal if and 
only if («|y) = δὲν for all and y in 8. 

Examples 


4, If y, is any orthogonal sequence of non-zero vectors, the se- 
quence 2, = || yn ||~'yn is orthonormal. For example: 


5. In the unitary space 65, the vectors (1/3,2/3,2/3), 
(2/3,1/3,—2/3), (2/3, —2/3,1/3) are orthonormal (see Example 2), 


6. Notation as in Example 3. One has || y, ||? = 27, and || 2, ||? 
= || yn ||? = π for n = 1,2,3,---. Define 


Un(t) = -- sin (nt) (n = 1,2,3,---) 


1 
v(t) = / dq 
υκ() = -- cos (nt) (πΞ 1,2,8,---). 


Then, the vectors %m,¥, are orthonormal. 


7. In the pre-Hilbert space of finitely non-zero sequences, let e; 
ig (1,0,0, +++), θὰ = (0,1,0,---), es = (0,0,1,0,---),---. The sequence 
of vectors e, is orthonormal. 


8. In the Hilbert space 15 (Example 5.1), let e, be the orthonormal 
sequence described in Example 7. If x = (A,), evidently (xe) = dg. 
In particular, for every vector 2, peg (| e,) |? < 0; this result holds 
= any orthonormal sequence, as a consequence of ‘Bessel’s inequal- 
ity’’: 

Theorem 3. (Bessel’s equality and inequality) Let x,,---,2, be ortho- 
normal vectors in a pre-Hilbert space. For every vector x, 


(1) [|« - ae (x| xx) ax |? = || x [3 — Σ | ez) Γ 
hence 


(2) ΣΤ (elzx) |? < || 2 IP. 
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Proof. a : 

If Xy,-::,An are arbitrary complex numbers, || >), Axe ||? = 
ἘΣ l] Age ||? = Dl Ae? by Theorem 2; calculating as in the proof of 
Theorem 3.4, 


I] 2 — Doe λιο [5 = ἢ} « [5 — (0, devel 2) 
— αἰ OF deme) + DOP el? 
= [1 [3 — D0) λεί! ax)" 
— Σ΄ (al ai)ae* + Dy eet 


= [ 2} — Dol lan) |? + D1 lee) — xP. 


In particular, setting \; = (x| 2x), this reduces to the relation (1); the 
inequality (2) follows at once. ἢ 


Corollary. If 21,%2,%3,:*- is an orthonormal sequence, then for any 
veclor x, 


Dd, | lax) /? < I 2 Il’. 
In particular, (α [ ἀκ) -- Ὁ α8 Καὶ --ὐ ~. 


Proof. 
Bessel’s inequality holds for each n. ἢ 


Remarks 

1. From the proof of Theorem 8, it is clear that the choice Ax 
= (ἡ [τι minimizes || x — 7. Azz |], and thus provides a “best 
approximation” of x by a linear combination of 11," + -,tn. Moreover, 
only one set of coefficients gives best approximation, namely A, 
= (x|x,). Note that if n > m, then in the best approximation by 
σι," " “γἴρν the first m coefficients are precisely those required for best 
approximation by 71," "γι. 


2. Notation as in Theorem 3. Let y = Pe (x|z,)a, and2z = ἃ — ἡ. 
Clearly (z|x,) = 0 for all k, hence (z|y) = 0. Thus, one has a decom- 
position s = y + 2, where y is a linear combination of 21,:+*,n, abe 
z 1 a; for all k. Such a decomposition is easily seen to be unique. 
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3. Bessel’s inequality for nm = 1 is essentially the Cauchy-Schwarz 
inequality; see the proof of Theorem 3.4. 


4, If με — 6, then (z|y,) — (x|@) = Ὁ for each vector x, by Theorem 
5.1. The converse of this proposition fails; for, with notation as in the 
above Corollary, (x|x;) — 0 for each x, although || 2, || = 1 precludes 


τι — 0. 


Example 9. Notation as in Example 6. The scalars ἂρ = (x| ug) 
(k = 1,2,3,---) and uz = (x|v,) ( = 0,1,2,3,---) are called the Four- 
ier coefficients of the function x. By the above Corollary, 


Do, lawl? + Σ οἱ με «[ | x(t) |? dt. 


[It can be shown that the sum is actually equal to the integral.] J 


Every orthonormal sequence of vectors is linearly independent, by 
the Corollary of Theorem 2. On the other hand, there is a systematic 
procedure for “orthonormalizing”’ any linearly independent sequence: 


Theorem 4. (Gram-Schmidt orthonormalization procedure) If y;,y2, 
5." 5" ts ἃ sequence of linearly independent vectors in a pre-Hilbert 
space, there exists an orthonormal sequence 2 ,%2,%3,:++ such that 
[ty ,°++,0n] = [y1,°*+,Yn) for all n (that is, 21,+++,%_ generate the same 
linear subspace as ψ15" * * Yn). 


Proof. 

The vectors x, will be defined inductively. Let 2; = |! y; ||~*y,. 
Assume inductively that orthonormal vectors 2, ---,%,—, are already 
defined, in such a way that [x,,--+,2%] = [y1,-+-,ys] fork = 1,---,n —1. 
The desired vector x, must be a linear combination of y1,---,%/n, Or 
equivalently of x1,---,%n—1,Yn; moreover, it must be orthogonal to 
each of 2,-*+,2n—;. Guided by Remark 2 following Theorem 8, set 
2=¥y,— δ τὶ (yn|2%)2%; then, z is orthogonal to 24,--+,t__3. 
Define 2, = || z ||~!z; this is permissible, since z = @ would imply 
that y, is a linear combination of x1,--+,%,—;, hence of y1,-++,Yn—1, 
contrary to the independence of the y’s. The reader can easily verify 
that every linear combination of x1,-- -,2, is also a linear combination 
Of ¥1,--+,%m, and vice versa. ἢ 


The Gram-Schmidt procedure applies equally well to a finite 
Sequence ¥),+**,Y¥, of independent vectors, and leads to orthonormal 
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vectors 21,---,2, such that [21,---,v] = [y1,-++,ye] for = 1,-++,n. 
In particular: 

Corollary. If @ is a pre-Hilbert space of finite dimension n, @ has a 
basis 11,"" γι of orthonormal vectors. 

Theorem 5. Every finite-dimensional pre-Hilbert space is complete, hence 
as a Hilbert space. 

Proof. 

By the above Corollary, there is a basis 2, - - ;tn of orthonormal vee- 
tors. If ἃ = p a Axre, then || x ||? = ΣΙ} by Theorem 2; com- 
pleteness can now be proved using a simplification of the argument 
given for I? (see Example 6.1). ἢ 


Exercises 


1. The relation || 2 + y ||? = ll |? + ll yl? is equivalent to (z|y) 
+ (y|x) = 0; denote this relation by z Py. Then x 1 y if and only if 
x P dy for all complex λ. 


2. Let @ be the pre-Hilbert space of finitely non-zero functions on 


a set 5 (Exercise 1.3). For each point 8 of 3, let xs be defined as in. 


Exercise 1.7.4. Then, the xs are an orthonormal set of vectors. That 
is, (vs|%,) = 6,: for all 8 and ¢ in 3. 


3. In the pre-Hilbert space of continuous functions on a symmetric 
interval [—a,a], every odd function z is orthogonal to every even 
function y (see Exercise I.5.6). 


4. Starting with the vector σι = (1,2,2,4) in the unitary space Θ΄, 
construct an orthogonal set of vectors 2},22,%3,%4 such that || x, || = 5 
for all k (integer components, preferably). 


5. In the pre-Hilbert space of finitely non-zero sequences, ortho- 
normalize the sequence of vectors y; = (1,0,0,---), y2 = (1,1,0,--+), 
¥s = (1,1,1,0,-- .}»5 cts 

6. In the pre-Hilbert space of continuous functions on [0,1], ortho- 
normalize the first three terms of the sequence yn(/) = ἐπ᾿ 
(n = 1,2,8,...). 
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§7. INFINITE SUMS IN HILBERT SPACE 

In the Hilbert space l?, consider the orthonormal sequence e, 
described in Example 6.8. If x = (Ay,-++,Am,0,---) is a finitely non- 
gero sequence, clearly z = ΕΝ Axex; One could formally write z = 
=. Axex, With the understanding that A; = Ὁ for all k > m. 

Consider now an arbitrary vector x = (Ax) in 13. What sense can be 
made of the expression ἃ = Zs Axex? It is natural to define Doe ALC 
to be the limit of the sequence of ‘partial sums” y, = Apex; this 
limit exists, in fact yn — 2, since || 2 — ψ ||? = || (0,- - -,0,λ 

2. n+] 

dna) 8 = D2 lel? 2 Oasn  «, be Mee 

These considerations will now be generalized for arbitrary ortho- 
normal sequences in Hilbert space. 
Lemma. Lei 21,%2,73,--+ be an orthonormal sequence of vectors in a pre- 
Hilbert space, and )1,A2,A3,° ++ @ sequence of scalars such that ye lel? 
< ὦ. Define yn = ρα λκακ. Then, the sequence y, is Cauchy. 
Proof. 


Ι Ynep — Yn [5 = || Dongs Ave ||? = Doe? Uf aver |I? 


+ 
= alae? —0 a n-w 0 


Throwing in completeness, we have 
Theorem 1. Jf x, 18 an orthonormal sequence of vectors in a Hilbert space, 
and), is a sequence of scalars such that Del? « ὦ, then the sequence 
Un = Do Ante converges to a limit x, denoted x = Duy ARTE. 

More generally: 
Definition 1. If x1,v2,2,- «+ is a sequence of vectors in a pre-Hilbert space, 


Such that the sequence yn = pa x, converges to a limit x, one writes 


oe a LE. 


Thus, | oy lk το > x, || —> 0, by definition. The basic prop- 
erties of the infinite sums described in Theorem 1 are as follows: 
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Theorem 2. Let x, be an orthonormal sequence of vectors in a Hilbert 


space. Suppose = δ. Ante and y = Do) μπᾶ, im the sense of The- 
orem 1. Then: 


(1) (νὴ) = ΣΤ \nue*, the series converging absolutely. 

(2) (w| ae) = Ae 

(8) ll zl? = Dr lakl? = ΣΙ @l ae) P. 
Proof. 

(1): Let s, = on Apa, and t, = ΕΝ μεῖκ. By definition, 85 -- ὦ 
and t, — y, hence (8, 9) — (x|y) by Theorem δ.1. Since (s,|tn) = 
ee jue *(xj|2x) = Σὰ λέμε", one has (z|y) = DoyAnme*. Re- 


placing (Ax) by (|Ax|), and (ux) by (|x|), it is clear that the conver- 
gence is absolute (see also Hxample 6.1). 


(2): This is a special case of (1), with μα = land»; = Oforallj ¥ ἢ. 
(3): Takey = zin (1). J 


Exercises 

1. If z, is an orthogonal sequence of vectors in a pre-Hilbert space, 
such that ag || xz ||? < ©, then the sequence y, = ΣᾺ x; is Cauchy. 
What if Σὺν || 2x || < οἵ 

2. Let xz, be a sequence of (not necessarily orthogonal) vectors 
in a pre-Hilbert space, such that }/° || xz || < 0. Then the sequence 
Yn = = x}, is Cauchy. Hence, in Hilbert space, 0) zx exists. 

3. If x, is an orthonormal sequence, and p ae A, exists in the sense 
of Definition 1, necessarily >)" |\x|? < ~. 

4. Give an example of a sequence z, such that pas l| xp ||? < ὦ, 
but for which the sequence y, = re x; is not Cauchy. 

5. If y = PDs yr, and x L y, for all k, then x L y. 


6. (Generalized Pythagorean relation) Let x, be an orthogonal se- 
quence in Hilbert space, such that Σὺν || «x ||? <«, and form the 


vectorx = >), x, according to Ezercise 1. Then || o IF = dy ll ve ll. 
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7. Suppose z, is an orthonormal sequence in a pre-Hilbert space, 
and x is a vector such that || 2 |? = δ [(α[“ι}}8. Then 2 = 
bs, (x| DE) re. 


8. If ¥1,¥2,¥3,-** is a sequence of vectors in a Hilbert space 5C, such 
that every vector x is a linear combination of finitely many y;, then 
5¢ is necessarily finite-dimensional. 


§8. TOTAL SETS, SEPARABLE HILBERT SPACES, 
ORTHONORMAL BASES 


Suppose x, is an orthonormal sequence of vectors in a Hilbert space. 
Given any vector «, the scalars ἂμ = (x|2,) satisfy Σ᾿. |x|? < ~, 
by the Corollary of Theorem 6.3. According to Theorem 7.1, one can 
form the vector y = )/) Antz, and by Theorem 7.2, (ν} τι) Ξε λὰ = 
(x|a,) for all k. When can one conclude that y = z? In any case, 
(y — x|a,) = (y|ax) — («|a~) = Ofor all k. Thus, one could conclude 
y = x if the vectors x; had the following property: the only vector z 
which is orthogonal to every 2; is the vector z = 6. This leads to the 
following definitions: 


Definition 1. A set 8 of vectors in a pre-Hilbert space @ is said to be 
total in case the only vector z of ® which is orthogonal to every vector of 
8 is the vector z = 0. A sequence (finite or infinite) of vectors x, is called 
a total sequence in case: if z | τὰ for all k, necessarily z = 0. 


Examples 


1. In any pre-Hilbert space @, @ is itself a total set of vectors. For, 
if z | x for every vector z, in particular z L 2. 


2. If 8. is any system of generators (Definition 1.8.4.) for the pre- 
Hilbert space @, 8 is total. For, if z is orthogonal to every vector in 8, 
it is orthogonal to every linear combination of vectors in $; in partic- 
ular, z 1 z. 


3. In the Hilbert space 15 (or in the pre-Hilbert space of finitely 
non-zero sequences), the sequence of vectors δι = (1,0,0,---), δὼ = 
(0,1,0,.- - .), eg = (0,0,1,0,- - “7, «++ is total. So is the sequence ὦ; = 
(1,0,0, - j “); tt, = (1,1,0,- , *); 3 = (1,1,1,0,- P *); νας. 
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*4. In the pre-Hilbert space of continuous functions on [—7z,z], it 
can be shown that the functions 1,t,¢”,- - - form a total sequence. So do 
the functions w1,2/2,%g, * - -,Vo,¥1,V2,03,-+* described in Example 6.6. 


Definition 2. A sequence (finite or infinite) of vectors x, is called an . 


orthonormal basis for a Hilbert space 3C, if it is (i) orthonormal, and 
(ii) ¢otal. 

Not every Hilbert space possesses such a sequence; those which do 
are characterized in T'heorem 3 below. 


Example 5. The sequence 6, described in Example 3 is an orthonormal 
basis for the Hilbert space 15. This will be referred to as the canonical 
orthonormal basis of ?. ἢ 

If a Hilbert space 95 has an orthonormal basis z, consisting of 
infinitely many vectors, σὉ is an infinite-dimensional vector space, by 
the Corollary of Theorem 6.2. It follows that the z, cannot form a 
basis, in the sense of Definition I.7.2, for the vector space δῦ; that is, 
not every vector x can be expressed as a linear combination of the x, 
(see Exercise 7.8). 

On the other hand, suppose @ is a pre-Hilbert space possessing a 
finite sequence 2,---,%, Which is orthonormal and total. If x is any 


vector in Φ, the vector ἃ — oe (x|2,)a% is orthogonal to every zz, — 


hence is 6. Thus, αὶ = ἘΣ (x|2,)z,, Φ is finite-dimensional, and 
21,°**,fn is a basis for Φ in the sense of Definition 1.7.3. Clearly, in a 
finite-dimensional space, the concepts “orthonormal! basis” and ‘“‘basis 
consisting of orthonormal vectors” coincide. 

By the remarks at the beginning of the section, 
Theorem 1. Jf x, is an orthonormal basis for the infinite-dimensional 
Hilbert space ὅ0, then for each vector x one has τ 2), (a| 2x) 2k. 

Alternative descriptions of an orthonormal basis are contained in 
the following: 
Theorem 2. // x,, is an orthonormal infinile sequence in a Hilbert space 
50, the following conditions are equivalent: 

(a) The τῷ are an orthonormal basis. 

(bt) SOF} @l zx)? = || « 15, for each vector x. 


(c) Σ (“ [ πκ)ῶκ = x, for each vector x. 
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Proof. 

(a) emplies (c) by Theorem 1. 

(c) implies (b) by Theorem 7.2. 

(b) implies (a): For, if («|a;,) = Ὁ for all k, clearly x = @ by (Ὁ). ἢ 

The obvious finite-dimensional analogs of Theorems 1 and 2 are 
true; the proofs are elementary. 

Which Hilbert spaces possess an orthonormal basis? Such a space 
necessarily contains a total sequence; it will be shown in Theorem 3 
that this condition is also sufficient for the existence of an orthonormal 
basis. 

Definition 3. A Hilbert space is said to be separable if it possesses a 
total sequence (finite or infinite). 


Examples 


6. If δῦ is a finite-dimensional Hilbert space, every basis y,+ + +,Yn 
is total, hence 3C is separable. 


7. The Hilbert space 15 is separable (see Example 5). 
Theorem 3. The following conditions on a Hilbert space 3C are equivalent: 
(a) δὺ is separable; 
(Ὁ) δ has an orthonormal basis Zp. 
Proof. 
(b) tmplies (a): This is clear from Definitions 2 and 8. 


(a) implies (b): Suppose 2z;,22,23,--- is a total sequence in 50. By 
Theorem I.6.2, there is a linearly independent subsequence 7/1,%/2,%/3,°* " 
of the z,, generating the same linear subspace as the χε. The y; are also 
total; for, if a vector z is orthogonal to every με, it is orthogonal to 
every linear combination of the yz, hence to every δὲ (hence z = δ). 
By Theorem 6.4, there is an orthonormal sequence 2),22,%3,-++ gen- 
erating the same linear subspace as the y,. The x; are total, by the 
above reasoning; thus, the τὲ are an orthonormal basis. 

Incidentally, if σὺ is finite-dimensional, the independent sequence 
Yx must be finite, say y1,-+*,Ynj then x1,- " +,2, isan orthonormal basis. 
Both the y; and the 2; are bases in the sense of Chapter I (see the 
femarks preceding Theorem 1). ἢ 
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There is another frequently used definition of ‘“separable’”’ Hilbert 
space, equivalent to the one given in Definition 3; this material is 
sketched in Exercises 3-5. 


1. In the pre-Hilbert space of finitely non-zero sequences = (Ax), 
let 8 be the set of all veetors x such that >> (1/k)Az = 0. Then 8 is 
a total set (actually a linear subspace). 


2. If a pre-Hilbert space @ possesses a finite total set 21,-*-,%m, 
then Φ is finite-dimensional (hence is a Hilbert space). An independ- 
ent set 7/1,:*-,Yn is a basis if and only if it is total. 


3. A metric space is said to be separable if it contains a sequence 2, 
with the following property: given any point ὦ in the space, there is a 
subsequence Z,, converging to x. Such a sequence <, 1s called a dense 
sequence. Show that if x, is a dense sequence in a Hilbert space Je, 
then x, is a total sequence, hence 3¢ is a separable Hilbert space in the 
sense of Definition 3. 

ῬΑ If 50 is a Hilbert space of finite dimension ἢ, and 2,---,2, is 
an orthonormal basis, then the vectors of the form z = Ἂς VEC, 
where the y; are Gaussian-rational (see Chapter I, ὃ 8), can be 
enumerated in a sequence, and this sequence is dense. Thus, JC is a 
separable metric space in the sense of Exercise 3. [The sophisticated 
point is the enumeration.] 

*5, Suppose 3c is an infinite-dimensional Hilbert space, separable in 
the sense of Definition 8. Then 3€ is a separable metric space in the 
sense of Exercise 3. 

*§. Verify Example 4: 

*7. It can be shown that every Hilbert space contains a total ortho- 
normal set 3; such a set is called an orthonormal basis for the space. 
(However, it may not be possible to enumerate 3 in a sequence.| 
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§9. ISOMORPHIC HILBERT SPACES; CLASSICAL 
HILBERT SPACE 


Definition 1. A Hilbert space SC is said to be isomorphic with a Hilbert 
space K if there exists a function T which assigns, to each vector x in KK, 
one and only one vector Tx in K, in such a way that the following con- 
ditions hold: 

(i) If x and y are vectors of δ such that x ¥ y, then Tx # Ty. 
(ii) If zis any vector in K, there is a vector x in K such that Tx = z. 
(iii) T(ie+y) = Tx+ Ty, for all x and y in XK, 

(iv) T(t) = A(T2), for all x in 3, and all scalars λ. 

(v) (Τα ΤῊ) = (ly), for all x and y in δ. 

Such a function T is called a Hilbert space isomorphism of JC onto K. 


Isomorphie Hilbert spaces are in a sense “equal,’’ for an isomor- 
phism Τ' distinguishes between distinct points, and “‘preserves’’ sums, 
scalar multiples, and scalar products. One can think of ® as being 
essentially the space σὺ with a “‘tag” 7' attached to each vector z. 

Two types of separable Hilbert spaces were noted in Examples 8.6 
and 8.7: (1) for each n, the unitary space @" is a separable Hilbert 
space, and (2) the Hilbert space /° is separable. Up to isomorphism, 
these are the only separable Hilbert spaces: 

Theorem 1. Let σὺ be a separable Hilbert space: 

(1) 17 5 has finite dimension n, it is isomorphic with e”. 

(2) If 00 is infinite-dimensional, it is isomorphic with 13, 
Proof. 

Let us show (2); the proof of (1) is much simpler, and is left to the 
reader. 

Suppose 50 is infinite-dimensional and separable. An isomorphism 7’ 
of σὺ onto 12 will be constructed. By Theorem 8.3, 5C has an orthonormal 
basis 2,,. 

Given any vector x in 50; the problem is to define Τὰ in 15. By the 
Corollary of Theorem 6.8, D0; | (x|xx)|? < ©, hence we may define 
Tx = ((x|2z,)); that is, Τὰ is the sequence whose k’th term is (2| zx). 
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Incidentally, x = δὴν (x|2x)2x, by Theorem 8.1. Let us verify the 
properties (i)-(v) of Definition 1: 


(i): Πα τέ y, then x — y γέ 6; since the στρ are total, (x — y|x;) γέ 0 
for some index 7, thus (x|x;) ¥ (y|2;). It follows that Τὰ τέ Ty, by 
the definition of equality in 15 (see Example 6.1). 


(ii): Given any vector (Ax) in ?. Define x = 3° κῶν as in The- 
orem 7.1. Since (x| xx) = Ax by Theorem 7.2, one has Tx = (d,). 


(iii): If x and y are vectors in δῦ, (x + y|x,) = (x| az) + (y| ax) for 
all k; this shows that T(z + y) = Tx + Ty. 


(iv): Similarly, ΤΊ) = A(T) results from (Ax| zz) = A(x| 2). 
(v): If x and y are vectors in 3C, then x = ja (x|a,)a;, and y = 


>; (y|xx)zx, hence (x|y) = Dd, (z|ze)(y|ax)* by Theorem 7.2. In 
other words, (z|y) = (Tzx|Ty). J 


Thus, up to isomorphism, there is just one infinite-dimensional 
separable Hilbert space: 


Definition 2. Any infinite-dimensional separable Hilbert space will be 
referred to as classical Hilbert space. 
Exercises 


1. If # and & are classical Hilbert spaces, there exists an isomor- 
phism of Ὁ onto K. 


2. If T is an isomorphism of a classical Hilbert space 920 onto a 
Hilbert space &, then X is also classical. 


3. If 60 is isomorphic with K, then XK is isomorphic with 3. 


Chapter 


Closed Linear Subspaces Π} 


§ 1. Some notations from set theory 
§ 2. Annihilators 

§ 3. Closed linear subspaces 

§ 4. Complete linear subspaces 

§ 5. Convex sets, minimizing vector 
§ 6. Orthogonal complement 

§ 7. Mappings 

§ 8. Projection 


§1. SOME NOTATIONS FROM SET THEORY 


A set %C is composed of objects 2,y,--- called the elements or members 
of the set x. As in the first two chapters, I assume that these terms 
are meaningful to the reader, without further elaboration. 

The statement “x is a member of the set {C’’ is symbolized “x € 9,” 
which may be read “x belongs to X.” For example: if Ὃ is a vector 
space, the symbol  € Ὃ means that z is a vector in the space U; if 
% is a metric space, x ΕΞ %X means that z is a point of the space X; 
λ € Ὁ means that ἃ is a complex number. 

If x € Xand y € ὃ, the statement “« = y” means that x and y are 
symbols representing the same element of X. One assumes the follow- 
ing properties of equality: (1) ἃ = x for every 7 CX; (2) if z= y, 
then y = zx; and (3) if ὦ = yand y = z, then z = z. These are known 
as the reflexive, symmetric, and transitive properties of equality. 

In addition, the following set-theoretic concepts and notations will 
be employed in the sequel: 


1. Let 9% and be sets. If every element of 9 is also an element of 

Y (that is, if ας ΕΞ ἃ implies x € Y), one says that X is a subset of Y. 

Other terms for this: X is contained in Y, X is included in Y, Y includes 
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Xx, Y contains X, Y is a superset of X. Notations: X C Y, or Y 39 &. 
For example, if δὶ is the set of all real numbers, and 8 is the set of real 
numbers in the interval [a,b], then ὃ C &. 


2. If both x C ¥Y and Y C &, the sets X and ¥ are said to be equal. 
This means: x € Xif and only if z € Y. Notation: 80 = Y. For ex- 
ample, if 90 is the set of all real numbers >0, and ¥ is the set of all 
real numbers which can be expressed as the square of a real number, 
then X = Y. 

3. Let ἃ be a set, and suppose that for each « € & there is given 
a statement involving x (denoted, say, by s(x)), which may or may 
not be true. The symbol 


{z Ε ἃ: s(z)} 


stands for the set of all 2 € 9% for which s(z) is true. For example, if 
δὶ is the set of all real numbers, and 8 is the interval [a,b], then 


S={reERiaczr<sbd}. 


4. A set ἃ is empty if it has no members (that is, the relation 
x € 2X does not hold for any x). Otherwise, X is said to be non-empty. 
The symbol @ denotes an empty set. For example, 


{fs R:x>2andz<l}=@, 
fc δι: 2? < 0} = @. 


5. The set whose only element is x is denoted {x}. The set whose 
only elements are 1,---,2, (not necessarily distinct) is denoted 
{1,+++,%}. Thus, αὶ €{2,--+,%,} if and only if x = x for some k. 


Exercises 

1. If X, Y, Z are sets, 

ἢ ἂς ἃ; 

() ἢ ἃ CYand ἃ C Z, then X C Z; 

(iii) 8. = ἃ if and only if both x C ἅ! and YC ἃ. 
2. If X, Y, Z are sets, 

(i) 8 - 8; 

(ii) if ἃ = αἱ, then ἅ! = 5; 

(iii) if © = Y and Y = Z, then X = Ζ. 
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3. If X and Y are both empty sets, then 9% = Y. Thus one can 
speak of the empty set. 


§2. ANNIHILATORS 


Definition 1. Let ὃ be a subset of a pre-Hilbert space @. A vector x ΕΞ Φ 
is said to be orthogonal to 8 in case x | 8 for all 8 € 8; notation: 
a L 8. The set of all such vectors x 18 called the annihilator of 8, and és 
denoted $+. Thus: 


8. = {x δ: (8) τ 0 forall 8 ( 8). 


Examples 


1. Recall that [0] is the set whose only element is the vector 6. 
Then, ®* = {6}. In fact, a subset 8 is total if and only if s+ = {6}. 


2. Clearly {6}+ = @. 


3. The convention is that @+ = @, where @ is the empty subset of 
Φ. The rationalization for this is as follows: given any vector z € @, 
one has (18) = 0 whenever 8 ΕΞ Οὗ (which is never). 


Theorem 1. If ὃ is any subset of a pre-Hilbert space Φ, then 8." is a 
linear subspace of ©. Moreover, if tn € 8. for all n, x ΕΞ Φ, and x, > 2, 
then x € 8". 


Proof. 

Clearly @ € S*, and S* is a linear subspace by Theorem II.6.1. 
Suppose t, 1 ὃ for all n, and ζῃ -- 2. For any 8 ( 8, (x|s) = lim 
(5,8) by Theorem 11.5.1; since (x,|s) = 0 for all n, (z/s)=0. ἢ 


Definition 2. Let 8 be a subsel of a metric space X. A point x € xX is 
said to be adherent lo 8 in case there exisis a sequence 8s, ΕἸ 8 such that 


Sy — J. 


_ In particular, if 8 is a subset of a pre-Hilbert space @, a vector x € @ 
is adherent to 8 if and only if there is a sequence 8, € $ such that 
lls, — «|| ~O0asn—. 


Definition 3. Let X be a metric space, 8  X. 8 is said to be a closed 
Subset of ὃ. if zt contains every point adherent to it. That is, the relations 
8) α 8,2 ΕΞ X, 8). — 2, imply x € 8. 
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Examples 


4. According to Theorem 1, if 8 is any subset of a pre-Hilbert space 
@, S+ is a closed linear subspace of 6. 


5. Let ® be the metric space of all real numbers (Example 11.4.3}, 
5 the “closed interval” [a,b], where a < b. Then 8 is a closed subset of 
δι. For, suppose a < sn < ὃ, and s, — 2, that is, [85 — z| — 0. The 
assertion is that a < « < b. Assume to the contrary, say, that x > ὃ. 
Then ¢ = x — b > 0, hence for sufficiently large πὶ, ὦ — sn < |e — Sn| 
<e=2—b, —s, < -- ὃ, 8, > ὃ, a contradiction. A contradiction 
is reached similarly if x < a. 

6. Let X be a metric space, y ( X, ¢ > 0, and ὃ = jx€ 8: d(x,y) 
< εἰ. Then, 8 is a closed set. For, suppose tp, C ὃ and ἃ, — 2. By 
Theorem 11.4.1, d(tn,y) — (x,y); since 0 < d(tn,y) S ε for all n, 
one has 0 < d(z,y) < εν Example 5. 


7. The convention is that the empty subset of any metric space is 
closed. For, if a is adherent to @ (which is never, since a sequence 
cannot be extracted from @), thenz ΕΞ ὦ. 

8. Every sphere {x: d(x,y) = εἰ is a closed subset. 

9. In a pre-Hilbert space, every closed ball {x: || « — y | < εἰ, and 
every sphere {z: || z — y || = εἰ, is a closed subset. In particular, the 
closed unit ball {x: || z || < 1}, and the unit sphere {z: || z || = 1}, are 
closed subsets. 

10. In a metric space X, ὃ. itself is a closed subset. 

11. Every finite subset 8 = {11,°-+,Ym} of a metric space X is 
closed. For, suppose z, € 8 and χη — 2; for some yj © ὃ, In = Yj 
for infinitely many n, hence x = y; € 8. In particular, every one-point 
subset {y} is closed. J 

If $ is a subset of the pre-Hilbert space @, one writes 8. for the 
annihilator of $+; thus, 8. ὁ = (S*)+. It is not necessary to define 
further “higher order perps,” in view of part (3) of the following 
Theorem 2. If $ and 3 are subsets of a pre-Hilbert space ®, 

(1 ὃς 8“; 
(2) $C δ implies 3+ C 5; 
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Proof. 


— @: Let x € 8. For any y € $*, one has y 1 2, hence x 1 $+, that 
is, ΕΞ ($+)*. Thus, z € Simpliesz € 851". 


(2): Suppose 8 C 3. If x L 3, then all the more x 1 8; thus, z € σ᾽ 
implies x Ε 81. | 


(3): Clearly (S*)*+ = (s++)+, since in each case one starts with 
$ and perps three times. Applying part (2) to the relation 8 Ὁ s*+, 
one has $+ > (5: 1)". Also, $* c (S+)++ by part (1). The last two 
inclusions combine to give the desired equality (3). J 


Definition 4. Jf S and 3 are subsets of a pre-Hilbert space @, one says 
that ὃ is orthogonal to 3 in case x L y whenever «x ΕΞ 8 and y € 3. 
Notation: ὃ .1 3. 

_ Evidently $ 1 $*, and {6} L 8, for every subset 8. The convention 
is that @ _ 8, for every subset 8. 


Theorem 3. Jf 3 and N are linear subspaces of a pre-Hilbert space ©, 
such that 511 L ON, then every vector x ΕΞ M + WM has a unique repre- 
sentation x = y-+2zwithy € Mandz € VW. 


Proof. 

See Definition I.5.2 for the notation. Suppose z = ψι + 2, = yg + 
Z2, Where yx © WM and z, ΕΞ M. The problem is to show that y; = ψὲ 
and z,; = zo. Define τὸ = y; — yo = 22 — z. Clearly w € Ot and 
w € W;since MW 11. RN, one hasw 1 w,hencew=6. ἢ 


Definition 5. Notation as in Theorem 8. One writes I @M for the 
linear subspace I + MN. Thus, the use of the symbol @ entails ortho- 
gonality of the summands. 


Exercises 


1. Let 8 be a non-empty subset of a metric space X, and z Ε &. 
Then, x is adherent to 8 if and only if: given any e > 0, there is at 
least one point 8 € 8 such that d(s,x) < e. 


2. If Ὁ is the metric space of all complex numbers (Zxample 11.4.9}, 
and (ἢ is the set of all real numbers, then Οἱ is a closed subset of 6. 
Also, the set 8 of all complex numbers ἃ such that |A| > « is a closed 
Subset of @. 
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3. Let 20 be classical Hilbert space (Definition 11.9.3), and 2, an 
orthonormal basis for 50. Let 9t be the linear subspace generated by 
the ἀρ, that is, the set of all finite linear combinations of the x. Then: 


(i) Every vector x € 3 is adherent to 2. 
(ii) Nis not a closed subset of 0. 


4, If $ and 3 are subsets of a pre-Hilbert space ®, the following 
statements are equivalent: 


() $13 
Gy δὰ" 
ie) Scs*. 


5. If $ is an orthogonal set (Definition II.6.2) of non-zero vectors, 
and 81, 85 are subsets of 8, then 81 .1 8. if and only if $; and 85 have 
no vectors in common. 


6. If 511 and 91 are linear subspaces of a pre-Hilbert space δ, then 
ας 1 9 if and only if || « + y ||? = || x ||? + || y |? whenever z € om 
and y € 2. 


7. If 9 and % are linear subspaces of a pre-Hilbert space @, such 
that 9% @ ot = Φ, then 911 = x, 91: = 911, hence ++ = ON and 
g++ = x. In particular, 9 and % are sloned ΤΕΣ subspaces. 


8. In the pre-Hilbert space Φ of continuous functions on [—a,q], let 
αὐ be the subspace of even functions, and 2 the subspace of odd 
functions (see Ezercise I.5.6). Then, 0 = IM (ὃ NM. 


8.3. CLOSED LINEAR SUBSPACES 


A linear subspace 9 of a pre-Hilbert space @ can be enlarged to a 
closed linear subspace by forming %** (see Theorems 2.1, 2.2). 
Another procedure for enlarging a linear subspace to a closed linear 
subspace is given in Theorem 2 below. It will be shown in § 6 that these 
two procedures yield the same closed linear subspace when @ Is a 


Definition 1. If 8 is a subset of a metric space X, the closure (or ‘“‘ad- 
herence’’) of 8 in 50 is defined to be the set of all points x ΕΞ X which are 
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adherent to 8 in the sense of Definition 2.2; it is denoted 8, Thus, 
ὃ = {x € X: xis adherent to $}. 
18 = &, 8 és called a dense subset of X. 
Remarks 
1. « € 8 if and only if there exists a sequence s, € 8 such that 


Sn - ἕξ, 


2. The convention is that @ = @. For, there are no points z ad- 
herent to @. 


3. 8 is a closed subset of δ if and only if 8 C 8 (see Definition 2.3). 


4. 8 C 8, for every subset 8. That is, every point 8 € $ is adherent 
to 8. (take s, = 8 for all n). 


5. 8 = $ if and only if $ is closed. 


6. If $C 3, then § C3. For, suppose x € 8. Say 8, — x, where 
S, © 8. Since also 8, € 3, x is adherent to 3. 


7. 80 is a dense subset of 80. If 8 is a dense subset of X, and 32> 8, 
then 3 is also a dense subset of ὃ, 


Theorem 1. Lei 8 be a subset of a metric space 9%. Then: 

(1) Sis aclosed subset of 2%. 

(2) ges. 

(3) If dis a closed subset of 9 such that 3 > 8, necessarily 5 39 8. 
In other words, ὃ is the smallest closed subset of % which contains 8. 
Proof. 


(1): Suppose y is adherent to 8. It must be shown that y € 8. 
Choose any sequence x, € 8 such that x, — y. For each n, choose a 
point 8, ΕΞ 8 such that d(sp,t,) < 1/n. Then, d(sp,y) < d(8n,2n) + 
U(2ny) S1/n + deny) > 0+ 0. Thus, s, > y, y Ε 8. 


(2): See Remark 4 above. 


(3): If S C 3, and 3 is closed, then 8 C ὅ = 3 results from Remarks 
Gand5. ἢ 
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Lemma. Jn a pre-Hilbert space: 

(1) Ifa, - cand yn — y, then n+ Yn > ὦ Ἔν. 

(2) Ifa, --ῷ xand dr» — A, then Antn — Az. 
Proof. 


(1): || @n + yn) -@+yl=@Ga-2)+@r-y Ils 
lla. — 2. ἘΠ νη -- ν}.-Ὁ 0.Ἐ0. 


(2): || Antn — Az || = || An — A) (Gn — Z) +AGn — α) + An — λ)2}} 
ΞΔ — Al? || en — α |] +)Al ll tan — 2 i] +[An — Al- [ “}} — 0-0 + 
[Δ1|-0 - 0.11} Ξ 0. ἢ 


Theorem 2. If It is a linear subspace of a pre-Hilbert space Φ, then Nt is 
a closed linear subspace of ©. Moreover, δ᾽, is the smallest closed linear 
subspace which contains Nt. 


Proof. 

Suppose z,y € 9, and ἃ is a scalar. Choose sequences z, € XN, 
Y¥n ΕΞ MNsuch thatz, — randy, — y. Then, tz, + yn, — z+ yand 
λα, — λα, by the Lemma. Since rz, + yn ΕΞ NM and Az, CN, r+ y 
and \z are adherent to 9, that is, they belong to 81. This proves that 
9 is a linear subspace; it is closed by Theorem 1. ἢ 


Corollary. Jf 91 is a linear subspace of a pre-Hilbert space ®, then 
xc 51... 


Proof. 
οι Ὁ is a closed linear subspace containing 9, by Theorems 2.1 and 
22. | 


Exercises 

1. In a pre-Hilbert space: if x, and y, are Cauchy sequences of 
vectors, and \» is a Cauchy sequence of scalars, then τη + y, and 
Ann are Cauchy sequences of vectors. 

2. If Sis a subset of a pre-Hilbert space, then $ and 8 have the same 
annihilator, that is, $+ = (8) ". 

3. If x, is an orthonormal basis for the classical Hilbert space 3, 
and δ᾽ is the linear subspace generated by the 2%, then 2 is dense in 50, 


4, Notation as in Exercise 3. Let 8 be the set of all vectors x which 
can be expressed as a linear combination of the x, with Gaussian- 


*] 
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rational coefficients. Then, 8 is not a linear subspace, but it is a dense 
subset of 3c. 


5. Let 8 and 3 be subsets of a metric space X. If every z € & is 
adherent to 5, and every ¢ © 3 is adherent to 8, then 8 is a dense subset 
of X. 


6. In a pre-Hilbert space Φ, the closure of {zx: || z — y || < εἰ is 
{x: || a — y || < e}. Ina metric space, the closure of {x: d(x,y) < εἰ is 
contained in {x:d(x,y) < e}. 


7. If Sis any subset of a pre-Hilbert space Φ, then 8 c 851. 


§4. COMPLETE LINEAR SUBSPACES 


The principal result of this chapter is the following theorem: if οἵ 
is a closed linear subspace of a Hilbert space 0, every vector z Ε 3C 
can be written in the form z = y + z with y € MN andz€ στ΄". This 
result 1s proved in §6, with the decisive lemma coming in § 5. The 
purpose of the present section is to introduce and illustrate a concept 
which plays an important role in these results: the concept of a 
“complete subset.” 

If 8 is a non-empty subset of a metric space 80, 8. is itself a metric 
space (Hxample IJ.4.4), it being understood that distances in 8 are 
measured as they already are in X. 


Definition 1. Let X be a metric space, 8 C 80. If 8 is a complete metric 
space, § is called a complete subset of X. This means: if s, Ε 8 and 
A(Smy8n) — 0, there is a point s in 8 such that d(s,,s) — 0. 


Remarks 
1. The convention is that @ is a complete subset of xX. 


2. Every closed subset $ of a complete metric space 2 is a complete 
subset. For, suppose 8, € 8, d(Sm,Sn) —> 0. Since X is complete, there 
is a point z ΕΞ such that s, — «x. Since 8 is closed, x € 8. 


3. Every complete subset ὃ of a metric space 2 is a closed subset of 
X. For, suppose sp € 8, ὦ Ε X, and sy — zx. Since d(sm,8n) — 0, and 
ὃ is complete, there is a point s Ε΄ 8 such that s, — 8. By the unique- 
ness of limits, = s 8. Thus, ὃ contains every point x which is 
adherent to 8. 


Combining Remarks 2 and 3, 
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Theorem 1. In a complete metric space (in particular, in a Hilbert space), 
a subset is closed if and only if it is complete. 


Corollary. If 91 is a closed linear subspace of a Hilbert space, N 1s vself 
a Hilbert space. 


The role of completeness is emphasized by the existence of theorems 
concerning closed subsets of Hilbert space which are valid for complete 
subsets of pre-Hilbert space. Important examples of this will be given 
in the next two sections; another is the following: 

Theorem 2. If 9 and σὲ are complete linear subspaces of a pre-Hil- 
bert space @, and M L MN, then M QBN is also a complete linear sub- 
space of Φ. 
Proof. 

See Definitions 2.4 and 2.5 for the notations. Let x, be a Cauchy 
sequence in IN + 9; say Tn = Yn + Zn, With yn € Mand z, € N. By the 
Pythagorean relation, || Ym — Yn ll? + I) 2m — 2n Ι = || (Ym — Yn) + 
(Zim — én) ||? = || (Ym + 2m) — (Yn + 2n) ||? = |] tm — an ||? — 0, hence 
Yn is a Cauchy sequence in 510, and zy, is a Cauchy sequence in 9ῖ. Since 
9 and 9 are complete, yn — y and z, — 2 for suitable y C ON and 
ΖΕ %. Then, zp = yn t Zn -τῦὸ y +2, by the Lemma to Theorem 
3.2. | | 
Corollary. 1 st and % are closed linear subspaces of a Hilbert space δῦ, 
and Ml .1 MN, then M (Ὁ WM is a closed linear subspace of 3. 

Tt will be shown in ὃ 6 that if οἵ is a closed linear subspace of a 
Hilbert space 3¢, then 5C = 91 (Ὁ M+. For finite-dimensional %, this 
15 easy: 

Theorem 3. Lei 0 be a finite-dimensional linear subspace of a pre-Hilbert 
space @. Then: 


(1) 92s complete (hence closed). 

(2) Φ- σι Ὁ 9". 

(3) 51 Ξ- 981. “". 
Proof. 

(1): 9 is complete by Theorem II.6.5. 

(2): Let y1,°*+,Yn be an orthonormal basis for 9t. Given any vector 
x & , define y = >, (| yx)yx and 2 = αὶ — y. Clearly 2 L yy for all 
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k, hence z 1 X. Thus, ὦ = y +2, with y CM and z Ε΄ 51. See 
Definition 2.5. 


(3): By Theorem 2.2, % C 5 Ὁ. Conversely, assuming « € "+, 
let us show that 2 € 51. By (2), we may write 1 = y +- z, with y € OM 
and z €& M+. Since both « € 9t** and y € at*+, one has z = 
a —y € ++; that is, z 1 9+, and in particular z L z. Thus, z = 6, 
andz=yeCun. ἢ 


Exercises 


1. Let 9 be a complete linear subspace of the pre-Hilbert space @, 
and suppose 91 is separable (Definition 11.8.8). Generalize Theorem 8. 
[One can even get along without separability; see Theorem 6.1]. 


2. If Φ is any pre-Hilbert space, @** = Φ even if Φ is incomplete. 
In the pre-Hilbert space of finitely non-zero sequences x = (Ax), let 
9 be the set of all vectors z such that A; = 0. Then 21 = ++, but 
9 is not complete. 


§5. CONVEX SETS, MINIMIZING VECTOR 


If x,y,z are any three points in a metric space, d(z,y) < d(x,z) + 
d(z,y). If d(a,y) = d(z,z) + d(z,y) for three particular points x,y,z, it 
is natural to say that “‘z lies on the segment joining z and μ᾽} (see, how- 
ever, Exercise 9). In pre-Hilbert space, this condition reads || « — y || 
= || 2 —2z|| + || z — y|| ; one can show that this condition is in turn 
equivalent to the existence of a real number a, Ὁ < a < 1, such that 
z= ax + (1 — a)y (see Exercise 11.8.8). The latter condition makes 
Sense In any vector space: 


Definition 1. Lei U be any vector space. If x and y are any two vectors, 
ihe segment joining x and y is the set of all vectors z of the form z = 
ax + (1—a)y, where0O<a<1. A non-emply subset 8 of Ὃ ts said 
to be convex if 8. contains the segment joining any two of tts vectors; 
that is, the relations x € 8, ΕΞ 8,0 <a < limplyax + (1 -- αν Ε 8. 


Examples 


1, Every linear subspace of Ὃ is convex. 


2. In a pre-Hilbert space, the closed unit ball 8 = {a: || # || < 1} is 
convex. For, ifz € 8,y ΕΞ $,and0 < α <1,then||/ae¢+(1—a)y||< 
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jar || - 11 G—eyll=ell¢|+a-a)|lyi]set+td—-a=1 
shows that az + (1 — a)y ΕΞ 8. 

Theorem 1. (Minimizing vector) Let 8 be a complete and convex subset 
of a pre-Hilbert space Φ. Given any vector « ΕΞ @; there exists one and 
only one vector y, ΕἾ 8 such that || xz — y. || < ||e«—y|| forall y C8. 
Proof. 

Let 6 = GLB { || « — y || :y ΕΞ 8}, and choose any sequence yn ΕΞ ὃ 
such that || z — yn || — 6. It will be shown that (1) y, is a Cauchy 
sequence, hence converges to a vector y, € 8 such that || « — yo || = 
δ, and (2) this equality determines y, € 8 uniquely. 


By the parallelogram law, || (Ym — 2) + (ὦ — yn) \|? + || Gm — 2) 
— (x — yn) ||? = 2| Ym — x 5 - 2} σ — yn ||”, hence 
ll Ym — Yn [15 = 2}} Ym — 2 [5 + 2 ||  — yn |? — || Gm + Yn) — 25 \|? 
= 2 {γι — ἁ |? -" 2|} 5 -- ll’ -- αἰ 5(Ym + Yn) στ ?. 
Since $ is convex, it contains the vector $(ym + Yn) = ἔν, + ἕνην 
hence || (ym + yn) — x || > ὃ by the definition of δ. It follows that 
ll ym — Yn I? <2 ll ym — τ |? +2 || -- yn ||? — 43°; 
as m,n — ©, the right member of the inequality -- 25? + 267 — 
46" = 0, thus y, is a Cauchy sequence in 8. Since 8 is complete, 
Yn — Yo for suitable y, ΕΞ 8. Since yn — Φ — yo — & by the Lemma 
of Theorem 3.2, one has || yn — z || — || yo — x ||. Thus, || yo — 2 || = 
ὃ 


Suppose also z, € 8 satisfies || 20 — x || = 6. Since $ contains the 
vector 4(Y%o + 2c), Yo = Zo results from the calculation 


Il Yo — Zo ||? = 2 || yo — # [5 + 2 || 2 — 2 ||? — || (Yo + %) — 20 ||? 
= 26° + 28° — 4 || δίψο + 20) — 2 |l? 
< 28° + 28? —487=0. J 
Quoting Theorem 4.1, 


Corollary 1. Let 8 be a closed and convex subset of a Hilbert space ὅθ, 
Given any x ΕΞ ὅ0; there exists a unique vector yo ἘΞ ὃ such that || ἃ — yo || 
Ξ [Πα -- ψ } forally cs. 


Corollary 2. Jf 8 zs a complete and convex subset of a pre-Hilbert space 
Φ, ὃ contains a unique vector yo of minimum norm. 
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Proof. 
Take x = @in Theoremi1. ἢ 


Exercises 


1. In a pre-Hilbert space: 

(i) The balls {z: || x |] < εἰ and {z: || 2 || < εἰ are convex. 

(ii) If 8 is convex, so is y + 8 (= the set of all vectors y + x with 
x € 8). 

(iii) The open and closed balls with center y and radius ε are 
convex. 


2. If 8 and 3 are convex subsets of a vector space, and λιμ are 
scalars, then 8 + 3, λϑ, and hence AS + μῦ are convex. 


3. In the pre-Hilbert space of finitely non-zero sequences, the set 
8 of all vectors z = (Ag) such that ἂρ is real and >0 is convex. Sim- 
ilarly in the Hilbert space 15. 


4. If $ is a non-empty subset of a metric space 8, and = is a point 
of X, the distance from x to 8 is defined to be the non-negative real 
number GLB {d(z,y): y € 8}; it is denoted d(z,S). Prove: z is ad- 
herent to 8 if and only if d(z,S) = 0; that is, 


$= iz € &: 5.3) = 0}. 


5. The conclusion of Theorem 1 may hold for certain non-complete 
convex sets. For example, let ® be any pre-Hilbert space, and ὃ = 
(x: || x || < 1}. 

(i) In the notation of Theorem 1, y, = αὶ when || z || < 1, and y, = 
|| z || 2a when |] x || > 1. 

(ii) Φ is complete if and only if 8 is a complete subset. 

(iii) Incidentally, Φ is complete if and only if {a: [{ || = 1} isa 
complete subset. 


6. Here is an example of Corollary 2. In the unitary space @”, 
let $ be the set of all y = (A1,--+,An) such that 30° ἂρ = 1. It is easy 
to see that $ is a closed (hence complete) convex subset of δῦ, The 
vector y, € 8 of smallest norm is y, = (1/n,:-+,1/n). 

7. If Sis not complete, the conclusion of Theorem 1 may not hold. 
For example, let Φ be the pre-Hilbert space of finitely non-zero 


Sequences x = (dx), and $ the set of all x such that Σὺν % = 1. It 
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is easy to see that 8 is convex. However, 8 does not contain a vector 
Yo of minimum norm. Incidentally, 8 is not closed. An example with 
$ closed is given in Exercise 4 of the next section. 


8. If $ is a convex subset of a pre-Hilbert space, then 8 is also 
convex. 


9. Let U be the vector space of continuous functions on [0,1] 
(Example 1.1.8}, regarded as a metric space via d(z,y) = LUB 
{|x(t) — y(t)|:0 <t <1} (see Lxample 11.4.8). Let x,y,z be the 
functions defined by the formulas x(#) = t, y(t) = —1, and z(t) = 0. 
Then, (i) d(z,y) = d(x,z) + d(z,y), but (ii) there does not exist a real 
number ἃ such that z = ar + (1 — a)y. 


§6. ORTHOGONAL COMPLEMENT 


Theorem 1. 17 51 8 a complete linear subspace of a pre-Hilbert space @, 
then Φ = N@Q στ, and R++ = 91. 


Proof. 
Given any vector x € @, let us show that there is a decomposition 
= y, +2 with y, € Nandz Ε 91}. 


Since 9 is complete and convex, there exists, by Theorem 5.1, a — 


vector yj, Ε 81 such that || 2 — y || < || x — y|| for all y ΕΞ N. De- 
fine z = x — yp; it will suffice to prove that z L %. 

Given any y € 2%, let us show that (y|z) = 0; there is no loss of 
generality in assuming || y || = 1. By the calculation made in the 
proof of Theorem 11.8.4, 

lle — Ay |? = lz)? — |@ly)|? + [@ly) — al’, 
for every scalar i. In particular, for A, = (z/y), 
(*) | 2 — roy [5 = || 2 |? — |@ly)I?. 


Now, 5 — Ay = (@ — Yo) — AY =  -- (Yo + Acy); Since Y + 
Aoy ΕΞ M, one has || xz — y, || < |] c — (yo + λοι) ||, by the choice of 
Yo. That is, || z || < || 2 — Avy ||; combining this with (ἢ), 

ll 2 (I? < lle — Ay [13 = ell? — ely)? < ell. 


Clearly (z|y) = 0. 
The proof of 91. ὁ = 9 is the same as in Theorem 4.3. ἢ 


Quoting Theorem 4.1, 
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Theorem 2. Jf 9t is a closed linear subspace of a Hilbert space ὅς, then 
50 = NO 51, and 91. ὁ = 9. 


Definition 1. If δὲ is a closed linear subspace of a Hilbert space 3, 
οὐ ὁ is called the orthogonal complement of 9. 


Remarks 


1. If 91 is a closed linear subspace of a Hilbert space 3c, the relation 
(9t+)+ = shows that the orthogonal complement of 91." is 9. 


2. If @ is a pre-Hilbert space such that every closed linear subspace 
οἵ of Φ satisfies the condition σὲ ὁ ὁ = 9, then Φ is necessarily com- 
plete (hence is a Hilbert space). See Hxercise V.1.3. This remark will 
not be used in the sequel. 


Theorem 3. Jf S is any subset of a Hilbert space 3, then $++ is the 
smallest closed linear subspace of 5C which contains 8. That is, 


(1) 8:1 is a closed linear subspace of 3C; 

Q) se 3"-: 

(3) if 9 is a closed linear subspace of KH such that 8 C M, neces- 

sarily $++ ΓΞ om. 

Proof. 

(1): see Theorem 2.1. 

(2): see Theorem 2.2. 

(3): Suppose 8 C 9, where S is a closed linear subspace of JC. 
Then, $+ D> σι, S++ C συ; quote Theorem 2. ἢ 
Corollary 1. 17 91 is any linear subspace of a Hilbert space ὅ0, δῖ, = nee, 
Proof. 

Compare Theorem 3 with Theorem 3.2. ἢ 


Corollary 2. Let It; be a sequence of closed linear subspaces of a Hilbert 
space 3C. There exists a smallest closed linear subspace St such that 
Nz CN for all k. One has x L N if and only if x L Ny for all k. 


Proof, 

Let $ be the set of all vectors « € 50 such that « ΕΞ ON, for some k. 
Clearly 9%; ΓΞ 8 for all k, and $ is the smallest subset of σὺ with this 
property. Set 91 = $++. Evidently 9% C$ C ® for all k. If 910 is a 
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closed linear subspace such that 9 C 910 for all k, then 8 C MM, hence 
9 C OM by Theorem 3. The last assertion follows from 91+ = 8. ὁ = 
8. (see Theorem 2.2). ἢ 


Exercises 
1. If 2X is a finite-dimensional linear subspace of a pre-Hilbert 
space Φ, another proof of Φ = 2 (Ὁ 2" (this is Theorem 4.3) can be 
deduced from Theorem 1 and Theorem II.6.5. Both proofs, at some 
point, make use of an orthonormal basis of 9. 


2. Let 9 be a linear subspace of a Hilbert space 50. Then 9% is a 
dense subset of σὺ if and only if it is a total subset of 50. 

3. In the pre-Hilbert space of finitely non-zero sequences + = (Ax), 
let 51 be the set of all 2 such that our (1/k)\y = 0 Then St is a 
closed linear subspace, but 9% γέ 27+. 

4. Completeness is essential for Theorem 5.1 (theorem on min- 
imizing vector). For example, let $ be a linear subspace of a pre- 


Hilbert space, such that 8 ~ $~~ (for instance, ὃ may be the closed 
linear subspace described in Hzercise 3). Then, Theorem 5.1 fails for 


the convex set 8. Indeed, if x is any vector which belongs to $** but _ 


not to 8, no minimizing vector y, exists. 


5. Let ® be a pre-Hilbert space possessing a total sequence 7» 
(Definition II.8.1). If 9t is a complete linear subspace of @, then 91 
also possesses a total sequence yn, hence is a separable Hilbert space. 


*6, (i) If X is a separable metric space (Hzercise 11.8.8}, and Sis a 
non-empty subset of &, then 8 is a separable metric space. 

(ii) In particular, if σὺ is a separable Hilbert space, and 2 is a 
closed linear subspace of 50, then 9t is a separable Hilbert space (see 
Exercises I1.8.8,4, and 6). Using an orthonormal basis for 9, deduce 
another proof of 50 = 2 (Ὁ 951-. 


*7. In Exercise 11.8.5, it is proved that a classical Hilbert space 3C 
is a separable metric space; in the proof sketched there, use is made of 
an orthonormal basis, and hence of the Gram-Schmidt procedure. 
Another proof runs as follows. Let x, be any total sequence in 3¢, and 
let 8 be the set of all finite linear combinations of the z, with Gaussian- 
rational coefficients. Then, 8 is a closed linear subspace, having orthog- 
onal complement {@},s0 8 = 50 results from Theorem 2. Since 8 can be 
enumerated in a sequence, 8 is a dense sequence in 3. 
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87, MAPPINGS 


Suppose 9 is a closed linear subspace of a Hilbert space 3¢. Given 
any vector xz ΕΞ δῦ, there is a decomposition x = ἢ +- z with y € ὅ1 
and z Ε 2* (Theorem 6.2); moreover, the vectors y and z are uniquely 
determined by x (see Theorem 2.3). Thus, one passes from the vector 
x € δ to the vector y € MN in a perfectly definite way. One may say: 
ἐμ depends on 2,” “given any x there is determined a corresponding 
y,” and so on. This is the familiar language of functional dependence. 
The terminology and notation of functions, to be used in the sequel, 
will now be described. 


1. Let % and Y be sets, both non-empty. One says that “Τ' is a 
mapping of X into Y” in case: for each x ΕΞ δῦ, there is determined 
one and only one element y ΕΞ Y, denoted y = Tx for y = T(z)], 
called the value of T at 2. The symbol 7: X — Y means that 7' 
is a mapping of 80 into Yy. One also speaks of “the mapping x — Τὰ 
(x € %).”? Synonyms for “mapping”: function, transformation. 


2. If ΤΊ κ᾽ — Y is a mapping, & is called the initial set, or domain 
of definition of Τ᾽; Y is called the jinal set of T. 
3. Mappings S:X% — Yand 7: ἃ — αἱ are said to be equal in case 


Sx = Tx for all  € X; that is, S and T are “pointwise equal.” 
Notation: S = T. 


4 If T: ἃ -- Y, and 8 is a subset of x, 7'(S) denotes the set of all 
Tx € Y as z varies over 8. Thus, 


T(S) = ty € ἅ{: y = Tz for some αὶ € 8) 
= {72: α ΕΞ 8). 
In particular, Τ (30) is called the range of Τ'. 


5. A mapping Τ: X — Y is said to be injective if it takes distinct 
values at distinct elements of ἃ, That is, the relations x, Ε &, 
%€ XL, 2, γέ te imply Tx, τέ Tx. Equivalently: Tz, = Tz, implies 
<, = rg. An injective mapping is said to be one-to-one. 


6. A mapping 7: 30 — ¥ is said to be surjective if Τα) = Yy. That 


is, given any y € ¥Y, there exists at least one ΕΞ %Xsuch that Tx = y. 
A surjective mapping 7: 50 -- ¥ is said to be a mapping of {% onto Y. 
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7. A mapping Τ': 8 — ἃ is said to be bijective if it is both injective 
and surjective. This means: given any ἢ € Y, there is exacily one 
zx ΕΞ X such that Τὰ = y. Thus, a bijective mapping is a one-to-one 
mapping of X onto Y; such a mapping is also called a one-to-one 
correspondence. 


Examples 


1. Let ® be the set of all real numbers, © = {tx ΕΞ δι: αὶ > —1}, 
and Y¥ = {x ( ®: x > 0}. The mapping 7: xX — Y defined by 
Tx = x + 2 1s injective, but not surjective. 


2. X and Y as in Example 1. The mapping S: X — Y defined by 
Sx = x” is surjective, but not injective. 


3. X and Y as in Lxample 1. The mapping U: X — Y defined by 
Ux = x + 1 is bijective. 


§ 8. PROJECTION 


Definition 1. Let St be a closed linear subspace of a Hilbert space XK. 
Given a vector x ΕἸ HX, suppose x = y + z is the unique decomposition 
with y & N and z € σι (see Theorem 6.2); the vector y is called the 
orthogonal projection of x on I. 


Theorem 1. Lei 9t be a closed linear subspace of a Hilbert space 3. For 
each x ΕΞ &, denote by Px the orthogonal projection of x on MN. Then, 
the mapping P: 3 — σὺ has the following properties: 


(1) (Pay|a2) = (@%|Px2) for all 21,42 € KR. 
(2) Py=yforally € 0. 
(3) Pz=6forallz€ σι". 
Moreover: 
(4) P(x + 2%) = Pa + Paz 
(5) Pz) = A(P2) 
(6) P(Pz) = Pz 
(7) (Px|x) = || Px |? < || =|? 
(8) 9 2s the range of P. 
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(9) N= {xe δ: Pr = 2} 
(10) 91τ = {2 € 50: Px = 9}. 
Proof, 
(1): Let δὶ = y; + 2 and x = ye + 2.5 be the orthogonal decom- 
positions of 2, and 22 relative to 91; that is, y,€ % and χε € m+. 
Then, Pa, = γι, Pa = yo, and (Px;|22) = (y1|y2 + 22) = (yi|y2) 


+ (νι [22) = (Yily2), while (2|Px2) = (ψι + 2 |y2) = (| y2) + 
(21|Y2) = (Yi | 5). ; 


(2): If y€ 91, the unique orthogonal decomposition of y is y = 
y +4. 

(3): If 2 € a+, its orthogonal decomposition is z = 6 + z. 

(4): With notation as in (1), 2 + a2 = (ψι + ye) + (a + 2); 


since y+yeCN and y~+2E RA, Play +2) =y+ y= 
Px, + P2o. 


(5): If x = y +z is the orthogonal decomposition of x, then Ax = 
Ay + Az is the orthogonal decomposition of Az, hence P(Ar) = Ay = 
(Pz). 


(6): For any z € HX, Px € N by Definition 1, hence P(Px) = Px 
by (2). 

(7): With notation as in (5), || x ||? = || y |? + ||z |? > lly |? = 
= also, (Px|z) = yly+2 = (Yyly) + (ψ} 2) = lly |? = 

til. 


(8): If  € P(3), then x € MN by the definition of P. Conversely if 
xE€E MR, then αὶ = Px € P(3K) by (2). 


(9), (10): obvious from the definition of P. ἢ 
Definition 2. The mapping P described in Theorem 1 is called the pro- 
lection of δ on N. The notation Py, is used to indicate the relationship 
of P to the closed linear subspace Dt. 
Exercises 


1. Let 91 be a closed linear subspace of a Hilbert space 20, and sup- 
pose 7’: H — σὺ is a mapping satisfying the following conditions: 
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(1’) (Τα. 2) = (a | Tre) for all 21,22 € 3; (2') Ty = y when y € 91; 
and (3’) Tz = 6 when z € M+. Then T = Py. 


2. Notation as in Theorem 1. Show: 

(i) P(x — y) = Px — Py; 

(ii) if tz, — 2, then Pz, — Px. 

3. Notation as in Theorem 1. If x = y + zis the orthogonal decom- 
position of 2 with respect to 0, define Qz = z. Then: 

(i) Px + Qc = zrforallz€ XK; 

(ii) P(Qxr) = Q(Pz) = θ for allz € XK; 

(iii) Q is the projection of 3¢ on 9+. 

4. If mt and % are closed linear subspaces of a Hilbert space JC, 
P = Poy, Q = Py, and M 1 %, then P(Qz) = Q(Pz) = θ for all 
z€& Xx. 

5. If οἵ and & are closed linear subspaces of a Hilbert space 3, 
P = Poy, and Q = Px, then 51: C N if and only if Q(Pz) = Pz for all 
2 € 5¢. In this case, Pw = θ for all ὦ € N~, and P(Qz) = Pz for all 
ze δ. 


Chapter 


Continuous Linear Mappings |V 


§ 1. Linear mappings 

§ 2. Isomorphic vector spaces 

ἃ 3. The vector space £(U,W) 

§ 4. Composition of mappings 

§ 5. The algebra £(U) 

§ 6. Continuous mappings 

§ 7. Normed spaces, Banach spaces, continuous linear mappings 
ὃ 8. The normed space £,(&,5) 

§ 9. The normed algebra £,(8), Banach algebras 

ἃ 10. The dual space &’ 


§ 1. LINEAR MAPPINGS 


If 91 is a closed linear subspace of a Hilbert space 50, and P is the 
projection of 3C on 9% (described in Theorem 111.8.1), the mapping 
P: σῷ — δῇ satisfies the conditions P(z + y) = Px + Py and P(Az) 
= \(Px). So to speak, P respects vector addition and multiplication 
by scalars. These conditions make sense for a mapping between any 
two vector spaces, and the mappings which satisfy them are of the 
greatest importance: 


Definition 1. If Ὃ and W are vector spaces, a mapping T: Ὃ — W is 
said to be linear in case 

(i) 7’ is additive: T(z + y) = Tx + Ty, for all xy € Ὁ; 

(ii) T is homogeneous: 7'(Ax) = (Tx) for all x € Ὃ and scalar x. 


Examples 


1. U and Ἃ any two vector spaces, T: 0 — W defined by Tx = 6 
for all z € Ὅ. T is called the zero mapping (or null mapping), and is 
denoted T = 0. 
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2. U any vector space, Τ': U — V defined by Tz = xfor allz Ε Ὁ. 
T is called the identity mapping of V, and is denoted T = J. 


3. U any vector space, μ a fixed scalar, T: VU — Ὁ defined by 
Tx = px for all « ΕΞ VU. The additivity and homogeneity of Τ᾿ come 
out of the axioms for a vector space: u(z-+y) = μὰ + uy, and 
u(az) = (wd)a = (Au)@ = λίμα). Such a linear mapping is called a 
scalar mapping in Ὅ. Clearly 0 and I are scalar mappings. 


4. U the vector space of n-ples (Example I.1.1), T:0 — Ὃ defined 
by Τίλι," . + An) = (0,A2,° - * An). 


5. U the space of n-ples, W the space of m-ples, n > m, and 
T:0 — W defined by Τίλι,"" An) = (Aj,°° * Am). 

6. Vand W asin Example δ, andS:W — V defined by S(Ay,- + +,Am) 
= (A1,° **Am;0,°* -,0). 

7. Ὃ the vector space of polynomial functions on [a,b] (Zxample 
1.1.4), with a < ὃ, and Τ Ὁ — Ὃ defined by Tz = x’ (= derivative 
of x). 


8. Ὃ as in Example 7. Define S: 0 — V as follows. If x € Ὁ, ἰοῦ. 


Sx be the unique polynomial y such that y’ = x on [a,b] and y(a) = 0. 
Thus, 


ἑ 
(Sz)() = f a(ujdu (ast<sb). 
9. U the vector space of finitely non-zero sequences (Example 1.1.6), 
Τ: Ὁ — Udefined by T'(Ai,A2,A3, °° *) = (λω,λᾳ,"" “). 


10. U as in Example 9, 5: 0 — V defined by S(A1,A2,A3,°: j= 
(0,A1,A2,A3,°**)- 

11. Let U be the vector space of n-ples, and suppose 1’,---,n’ is any 
rearrangement of 1,---,n. Let T: 0 — Ὃ be defined by Τίλι," " *,An) 
= (Ay, ae An‘): 


12. Let Ὃ be the vector space of n-ples, W the vector space of 
m-ples, and suppose αγκ are fixed scalars (7 = 1,-*-,m; k = 1,-+>,n). 
Define 


T(M1,°**,An) = ὍΣΩΝ αὐ κὰκ 5» pa mide }. ἢ 
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. By definition, a linear mapping “preserves” sums and scalar mul- 

tiples; it also preserves 6, negatives, differences, and linear combina- 
tions: 
Theorem 1. Jf T: Ὃ — W is any linear mapping, 

(i) Te=6 

(2) T(-—z) = —(T2) 

(3) Ta@—y) =Tx—Ty 


ὦ (Xt we) = DP a(n) 


Proof. 
(4): The proof is by induction on n. The case n = 1 is simply the 
assertion that Τ' is homogeneous. Assuming Τ' boy Ant, | = 


ΣῊΝ λ. {{χμ}), one has r(x! nn) = Τ (= ins .1- Nata) = 


T (1 wre) + TAntn) = a Ἀ 2) + A(T2a) = 


5 λυ Τα). 
(1), (2), and (8) are special cases of (4), since @ = Oz, —x = (—1)z, 
andz—y=I1¢r+(—-lLy. J 


A linear mapping is uniquely determined by its effect on any system 
of generators: 


Theorem 2. Suppose ὃ: 0 — W and T: Ὃ — W are linear mappings, 
τε ie a system of generators for U. If Sx = Tx for all x € 8, then 


Proof. 

Let N = {x € VU: Sz = Tx}; by assumption, 8 C 9ῖ. Clearly θ 6 x; 
fee 91, ν € 9ῖ, and d is scalar, S(xz + y) = Sx + Sy = Τὰ + Ty = 
T(x + y) and S(Az) = A(Sx) = X(Tx) = T (da), hence δῖ is a linear 
Subspace. Since U is the smallest linear subspace containing $ (see 
Theorem 1.5.2), Ὃ = KN. That is, Se = Τὰ ἴον 8111 ΕΞ Ὁ. ἢ 


A useful source of linear mappings is the following: 
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Theorem 3. Let Ὃ be a vector space of finite dimension n, and Ww any 
vector space. If σιν" "δα ts a basis for Ὃ, and y1,°**,Yn are arbitrary 
given vectors in W, there exisis one and only one linear mapping 
T: Ὃ -ὦ W such that Tx, = yx for all k. 
Proof. 

Given any vector x € Ὅ, the problem is to define Tz € W. Suppose 
x = 51" dete; since the coefficients dj, are uniquely determined by ὦ 
(Theorem I.6.1), one can unambiguously define T'x = 2. eye ‘The 
linearity of T is immediate from Theorem 1.4.1, and clearly Tx; = Yk 

If S: Ὁ — W is another linear mapping such that Sz, = ψκ for all 
k, S = T by Theorem 2. ἢ 

If T: 9% — Y is any mapping, and 5 is a subset of Y, the set of all 
χ € X such that Τῷ € 3 is called the inverse image of 3 under Τ, 
and is denoted 1 1(6). Similarly, if $ C 9, the set 7 (8) described in 
87 of Chapter III is called the direct image of ὃ under 7’. 
Definition 2. Jf ΤΙ 0 — W is a linear mapping, the set N of all 
vectors « Εἰ Ὃ such that Tx = θ is called the null space of T. Thus, 


R= [ ( Ὅ: Tr = 0} = T*({0}). 
Under a linear mapping, the direct and inverse images of linear 
subspaces are themselves linear subspaces: 
Theorem 4. Let T: 0 — W be a linear mapping: 
(1) If VU, is a linear subspace of 0, Τ (Ὁ) is a linear subspace of W. 
In parti , the range T (0) of T is a linear subspace of W. 
(2) If W, is a linear subspace of W, T—1(9,) is a linear subspace of 
"©, In particular, the null space δῖ = T~'({0}) of T is a linear 
subspace of Ὁ. 
(3) Tx, = Tx tf and only tf τὶ — %2 Ε. ῖ. 
(4) T is injective if and only if Su = [0]. 
Proof. 

(1): Since θ € Up, 6 = TOC T(U,). Suppose ¥1,y2 ΕΞ Τ(Ὁ,), and A 
is scalar. Say yer = Tze, where ἂρ ΕΞ Vo. Since 2 + 2 © Vs and 
Ax, Εἰ Vp, one has yy + yo = Tx, + Tre = Τίσι + %2) € To) and 
λυ! = MTxr) = TAM) € TV). 
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(2): Since Τθ = 0 € Wo, 0 € Το (92). Suppose 2,22 € T(W,), 
and ἃ is scalar. Since W, contains Tx, and T'zo, it contains Ta, + 
T'Xo = T(x, a Xo) and λ( 11) ΞΞ T (Az), hence 1 a to and Ad, be- 
long to Τὼ. 


(3): The following relations imply one another: Tx; = Tze, Tx, — 
Τὰς = 0, T(x, — χ2) = 0, τὶ — m2 CN. 


(4): Suppose Τ' is injective; if  € N, then Tx = θ = Τθ, hence 
xz = 6. Suppose, conversely, that 91 = {0}; if Tx, = Tro, then x, — 
tq ΕΞ MN, hence x, —2%2=86. Ε 


Suppose Φ is any pre-Hilbert space, and y is a fixed vector in @. 
Defining Τὰ = (x|y), one obtains a linear mapping 7: Φ — 6. On 
the other hand, the mapping S: ® — 6 defined by Sx = (y/z) satis- 
fies the conditions S(a, + x2) = Sa, + Saxe and S(Ar) = \*(Sz). This 
is an example of another important type of mapping between vector 
spaces: 


Definition 3. [f U and W are vector spaces, a mapping S: Ὃ - W is 
said to be conjugate-linear (or ‘“‘semilinear’’) in case 


(i) S ts additive: S(z + y) = Sx + Sy 
(ii) S ἐξ conjugate-homogeneous: S(\x) = A*(Sz). 


Several important examples of conjugate-linear mappings will occur 
in the sequel, notably in § 10. 


Exercises 


1. If Τ: Ὁ — W is merely additive (and not necessarily homo- 
geneous), formulas (1), (2), (3) of Theorem 1 still hold. 


2. If 7: ¢* — @” is any linear mapping, there exist scalars a, in 
terms of which 7 can be expressed as in Example 12. 


3. In Examples 1 through 12, discuss null space and range. Which 
of these mappings is injective (resp. surjective, resp. bijective)? 


4. Let T: 0 — W be a linear mapping, 8 a system of generators 
for U, and 3 an independent subset of Ὅ. Then: 

(i) If Τ' is surjective, 7'(S) is a system of generators for W. 

(ii) If 7 is injective, Τ (5) is an independent subset of W. 
If 8 is an arbitrary subset of Ὁ, [Τ (8}] = T({S]) (see Definition I.5.3). 


᾿ 
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5. Let 7:0 — W be a linear mapping. 

(i) If T is surjective, and Ὃ is finite-dimensional, then ‘W is finite- 
dimensional. 

(Gi) If T is injective, and V is infinite-dimensional, then W is 
infinite-dimensional. 

(i) If T is surjective, and ‘W is infinite-dimensional, then U is 
infinite-dimensional. 

(ii’) If T is injective, and ‘W is finite-dimensional, then V is finite- 
dimensional. 


6. Suppose Ὃ has dimension 7, ‘W has dimension m, and 7:0 — ‘W 
is a linear mapping. Given a basis ιν" - - nm for Ὅ, and a basis y;,-* * Ym 
for W, one has Tx, = Ἔρος a;,y; for suitable scalars αμκ(} = 1,-++,m; 
k = 1,--+,n). The coefficients (αι) can be arranged in a rectangular 
array, of m rows and n columns, with aj, occurring as the inter- 
section of the j’th row and k’th column. This array is called the 
matrix of T with respect to the given bases. If S: Ὃ — ‘W is another 
linear mapping, with matrix (8;,) relative to the same given bases, 
then S = T if and only if 8j, = αῃκ for all j and k. 


*7. It can be shown that every vector space U has a basis 5, in the — 
sense of Definition I.7.2. Granted this result, if W is another vector . 


space and 7:3 — W is any mapping, show that there exists one and 
only one linear mapping S: 0 — W such that Sz = Tx for all « € 3. 


8. The obvious analogs of Theorems 1 through 4 hold for conjugate- 
linear mappings. 
§ 2, ISOMORPHIC VECTOR SPACES 
Definition 1. A veclor space Ὃ is said lo be isomorphic with a vector 
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2. If μ ~ 0, the scalar mapping Tz = uz is a vector space isomor- 
phism of 0 onto U. For, T'(u~'y) = y shows that T is surjective, and 
T is injective by Theorem I.2.1. 


3. The mapping Τ(λι," - .,.λ}) = (Ay,*++,An’) described in Example 
1.11 1s a vector space isomorphism. | 


If 7: X — Y is any bijective mapping, one can produce, in a 
natural way, a mapping S: Y — & in the “reverse direction.” The 
mapping 4, called the inverse of 7’, is defined as follows: given y € Y; 
since 7’ is bijective, there is exactly one « Ε & such that Τὰ = y, 
hence one can unambiguously define Sy = x. Thus, Sy is the unique 
element of ὃ. such that T(Sy) = y. If x € x, S(Tx) = 2; for, setting 
y = Tx, Tx = y shows that Sy = ἃ. | 


Definition 2. /f T: X — Y is a bijective mapping, the inverse of T is 
denoted T~'. Thus: 


Gi) Το: > &. 
Gi) Τ {ΤΙ} = 2 [ον αἱ! « Ε &. 
(iii) ΤΟ ἢν) = y for ally € Ὁ. 


It is easy to see that if Τ᾽: 55 — Y is bijective, then 7~ is also bi- 
jective, and (T~") = T. 


Theorem 1. Jf 7: Ὃ — W isa vector space isomorphism, then 
T*: W — Ὃ is also a vector space isomorphism. 
Proof. 

Since S = 7 is bijective, the problem is to show that S is additive 
and homogeneous. Given 41,72 ΕΞ ‘W, and ἃ scalar. One has 
ΤΙδίψι + y2)] = γι + ye = Τίϑψι) + T(Sy2) = Τίϑψι + Sye); since 


space Ἅ) in case there exists a bijective linear mapping T: Ὃ — W. 
Such a mapping T is called a vector space isomorphism of Ὃ onlo “Ὁ. 
The symbol Ὃ & W denotes that Ὃ is isomorphic with W. 


T is injective, S(y; + yo) = Sy: + Syo. Also, T[S(Ay)] = Ay = 
A(T (Sy)] = T{A(Sy)], hence Say) = A(Sy). ἢ 


Example 4. Let Ὃ be the vector space of polynomial functions on [a,b], 
T the linear mapping Tx = 2’ (Example 1.7), and S the linear map- 
ping described in Lxample 1.8. If y is any polynomial, T(Sy) = y 
shows that 7᾽ is surjective. However, T is not injective, for its null 
Space 2 is the set of all constant polynomials. The mapping S is 
Injective; for, if Sy = 0, then y = T(Sy) = TO = 0. However, S is 
hot surjective, since its range is the set of all polynomials which are 
divisible by ¢ — a (recall that (Sy)(a) = 0). 
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1. If Ὁ and Ψ have the same finite dimension n, then VU is isomor- 
phic with ‘Ww. For, suppose 21,---,2n is a basis for D, and y1,--+,Yn 18a 
basis for W. By Theorem 1.3 there exists a linear mapping 7: 0 — 'W 
such that Ta, = yp for all k. It is easy to see that 7’ is bijective 
(see also Exercise 1.4). 


-- = 


πα 


“Ὁ “αὶ 


Se 


ee - — σαι. πὶ 


“ὦ. τὰ 9 aes 


_ — — -- - - » 
Εν... ee 5 ἀ΄ἀο 


84 Introduction to Hilbert Space Iv §3 


Exercises 


1. Let T: Ὁ — W be a linear mapping, 3 a basis for 0. Then, T is 


an isomorphism if and only if 7'(5) is a basis for W. 


2. If UV has finite dimension n, then U = W if and only if W has 
finite dimension n. 

3. If UV is the vector space of n-ples, and ‘W is the vector space of 
polynomial functions of degree <n — 1 defined on the interval [a,}], 
a <b, then Ὁ > W. 


4. The vector space of finitely non-zero sequences is isomorphic 
with the vector space of all polynomial functions on [a,b], a < ὃ. 


δ. If Ὁ is a finite-dimensional vector space, and 7: Ὃ — Visa 
linear mapping, the following conditions are equivalent: 

(a) Tis bijective 

(Ὁ) T is injective 

(c) T is surjective. 

6. Suppose 7: xX — YandS:Y — ἃ. 

(i) If S(7x) = x for all z € &, then T is injective. 

(ii) If T(Sy) = y for all y CY, then T is surjective. 


7. If 7:2 — Yis bijective, and ὃ ΓΞ ¥, the two possible interpreta- 
tions of the symbol 7'~'(S) are consistent; that is, the inverse image 
of $ under T coincides with the direct image of 8 under 7". 


8. Notation as in Examples 1.9 and 1.10. Discuss Τ (85), S(Tx) — 


injectivity, surjectivity, bijectivity. 


9. Let ΤΊ X — YY, and suppose there exist mappings ὦ: y — 5 
and S: ἃ, — % such-that S(Tx) = =z for all ᾧ ΕΞ ἃ, and T(Ry) = y 
for all y © Y. Then, 7’ is bijective, and R = S = 7". 


10. Suppose 7’: X& — Y is bijective. 
(i) IfS:y — 80, and S(Tx) = z for all x € &, then Καὶ = 7. 
(ii) If ΚΕ: ἃ — &, and T(Ry) = y for all y € Y, then R = τ, 


83. THE VECTOR SPACE (Ὁ, Ὁ) 


Definition 1. Jf Ὁ and ‘W are vector spaces, £(U,W) denotes the set of all 
linear mappings T: 0 — Ὁ. 
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Thus, the statement “Τ' € £(U,W)” means that T is a linear map- 
ping of Ὃ into W. If S, 7 € £(0,W), S = T means that Sz = Tx for 
allz & V. 

It will be shown in this section that £(U,W) is itself a vector space, 
with appropriate definitions of S + 7 and XT. The necessary defini- 
tions are as follows: 


1, The zero mapping belongs to £(0U,W); that is, 0 € £(V,W), 
where Oz = @ for allz ΕΞ Ὁ. 


2. If 8,T € £(0,W), define S+ 7: Ὁ - W by the formula 
(S+ T)x = Sx + Tz. If z,y € Ὁ and d is sealar, (S + T)(x + y) = 
Sia@+y) + Tie@t+y) = Sx + Sy+ Tx + Ty = Se + Tz) + 
(Sy + Ty) = (ὃ + Thx + (8 + Τὴν, and (S + T)(z) = S(d2) + 
T(z) = (Sz) + A(Tz) = AGe + Tr) = A(S + T)z]; thus, 
S+T€ £(0,W). 


3. ΠῚ Ε £(0,W), and d is sealar, define AT’: 0 — ‘W by the for- 
mula (AT')z = (Tz). If zy € Ὃ and μ is sealar, (AT)(x + y) = 
AT (x + y)] = (Tz + Ty) = (Tz) + MTy) = (ΤῊ: + (AT)y, and 
(AT) (ux) = A[T(ux)] = Au(Tx)] = (u)(Tx) = (μλ)( 2) = μιλ(7.)} 
= p[{(AT)a]; thus, AT € £(0,W). 


4. fT € £(0,W), define —T: 0 — W by the formula (--- ΤῊΣ = 
— (Tz) = (—1)(Tz). Thus, —T = (—1)T € £(0,W). 


Theorem 1. Jf Ὃ and W are vector spaces, the set £(V,W) of all linear 
mappings T': Ὃ — W ts a vector space, with respect to the following 
definitions: δὰ wcll 


(—T)x = —(Tzx) 
(S + T)z = Sx + Τὰ 


(AT )a2 = X(T2). 
Proof. 

The problem is to verify the axioms for a vector space, listed in § 1 
of Chapter I. For example, the proof of (A1) for £(U,W) depends, in 
ἃ perfectly direct way, on its validity in W. Thus, suppose 
8,T € £(0,W); given any σεῦ, (8 1). =Se+T7Txr = Tr + 
δὰ = (1 + S)z, hence S + T = T+. The verification of the re- 
maining axioms is left to the reader. ἢ 


Example 1. If μι is a scalar, and 7: Ὃ — V is the scalar mapping 
Tx = μα, then T = ul, where 1 is the identity mapping. 
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Exercises 
1, Let 3 be any non-empty set, W any vector space, and let (5, W) 
be the set of all mappings 7:3 — W. If τον ΕΞ 2(5,%), and ἃ is scalar, 
oe (e+) - τῷ Ἐνῶ 
(λυ)(ἢ = λυ(ἢ 


for all ἐ © 3. Then, ϑ0(σ,.Ψ)}) is a vector space relative to the operations 
x + y, Ax (obvious definitions of 0 and —z). 


2, A linear subspace of the vector space %(5,W) described in Ever- 
cise 1 is obtained by considering the set of all mappings 7:5 -- W 
such that τί) = θ for all but finitely many ἐ. 


3. Suppose Ὃ has finite dimension n, and W has finite dimension 
m. Then £(V,W) has finite dimension mn. 


4, Notation as in Exercise 3. Let S,T ΕΞ £(0,W), ἃ scalar. Given 
bases for 0 and ‘W, describe the matrices of S + T and XT in terms 
of the matrices of S and T' (see Exercise 1.6). 


5. If 0,W are finite-dimensional, show that £(U,) is isomorphic | 


with ον, Ὁ). 


6. Notation as in Exercise 1, and assume W τέ {6}. Suppose 
(5,) has finite dimension N. Then, 3 has a finite number n of 
points, W has finite dimension m, and N = mn. 


§ 4. COMPOSITION OF MAPPINGS 


Given mappings T: 80 — Yy and S: y — Z, there is a natural way 
of defining a mapping of & into Z: if x € αὐ, then Tr € Y, hence 
S(Txz) € Z. The mapping z — S(Tz) is called the composite of S and 
Τ᾽, and is denoted ST’. Thus, 


ST:X% - Z 
(57). = S(Tx) (x € &). 


ey Z 
NAS 


Schematically, 
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To define the composite ST’, it is required that the initial set of S 
coincide with the final set of 7’. It follows that if one is given mappings 
T: Ὁ — Y, S:Y — Z, and Rk: Z— 35, all of the following composites 
are definable: RS, ST, (RS)T, and R(ST). Such composites obey the 
“associative law’: 


Theorem 1. ΠῚ ΤΊ ἃ — Y,S:Y — Z, and R: Z — 3, then R(ST) = 
(RS)T. 


Proof. 
For all « € 85, [R(ST)]z = R[(ST)z] = RIS(Tx)] and [(RS)T]z = 
(RS)(Tx) = R{S(Tx)}. ἢ 


Example 1. If 7: % — Y is bijective, and S = T~', the identities 
S(Tx) = «and T(Sy) = y can be written ST = J and TS = I. 


Theorem 2. Jf U,U,W are vector spaces, and T:U - Ὁ, 8S: 0 > W 
are linear mappings, the composite ST: U — W is also linear. 


Proof. 

If zy CU and X is sealar, ST)@+y) = STae+y)= 
S(Tx + Ty) = S(Tx) + S(Ty) = (ST)z + (ST)y, and (ST)(Az) = 
S[TAx)] = S[A(T2)] = AS(T2)] = AST)z]. ἢ 


Exercises 


1. Given T:X% - Y, 8S: Y — Z. 

(i) IfS and T are both injective, then ST is injective. 

(1) If ST is injective, then 7’ is injective. 

(iii) If ST is injective, and T is surjective, then S is injective. 

(iv) Give an example where ST is injective, but S is not injective. 


2. Given T:X%& — Y, δ Y — Z. 

(i) IfS and T are both surjective, then ST is surjective. 

(ii) Ii ST is surjective, then S is surjective. 

(iii) If ST is surjective, and S is injective, then T' is surjective. 
(iv) Give an example where ST is surjective, but 7’ is not surjective. 


3. Given T:X% - ὁ ἃ, δ: ἃ — Ζ. 

(i) IfS and Τ' are both bijective, then Θ᾽ is bijective. 

(ii) If ST is bijective, then S is surjective and 7᾽ is injective. 

(iii) If ST’ is bijective, and either S is injective, or Τ' is surjective, 
then both S and T are bijective. 
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(iv) Give an example where ST is bijective, but one of S,T (hence 
both) fails to be bijective. 


4. Let ΤΊ X — YY. One says that T is left-invertible if there exists a 
mapping S:Yy — δὲ such that ST = J (= identity mapping of 9x); S 
is then called a left-inverse for T. One says that Τ' is right-invertible if 
there exists a mapping #: Y — & such that TR = J (= identity 
mapping of Y); F# is then called a right-inverse for T. Prove: 

(i) YT is injective if and only if T has a left-inverse S. 

*(ii) YT is surjective if and only if 7 has a right-inverse R. 

(iii) YT is bijective if and only if 7 has both a left-inverse S and a 
right inverse R. In this case, R = S = T~', and in particular all 
left-inverses and right-inverses for 7' coincide with the mapping ΤΟ“. 


5. Given T: XY, S: Y - Ὁ Z, Ἰοὺ R= ST. If ὅς Z, then 
ΕΘ}. 


6. Notation as in Theorem 2, with %U,0U,W finite-dimensional. 
Choose bases for U,U,W, and describe the matrix of ST in terms of 
the matrix of S and the matrix of 7. [See Ezercise 1.6.] 


7. Let U,0,W be vector spaces. Prove: 
G) UU, 
(ii) If ULV, then Ὁ =U. 
(11) If US Vand Ὁ = W, then U = W. [See Definition 2.1 for 
the notation. | 
§5. THE ALGEBRA (Ὁ) 


Definition 1. /f Ὃ is a vector space, £2(U) denotes the set of all linear 
mappings T: Ὃ — V. 


By Theorem 3.1, £(0) = ρ(Ὁ,Ὁ) is a vector space. Thus, if 
S,T ΕΞ (Ὁ) and d is sealar, £(V) contains the linear mappings S + Τ' 
and AT’; it also contains the composite ST’, by Theorem 4.2. 


Theorem 1. Jf 2,S,T Ε (Ὁ), and Ἃ ts scalar, 
(1) RST) = (AS)T 
(2) RS+T) = RkS+ RT 
(3) S+M)R=SR+TR 
(4) AQST) = (S)T = SAT). 


i 
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Proof. 

(1): See Theorem 4.1. 

(2): [RS + Τὴ] = RUS + T)a] = R(Se + Tx) = R(Sx) + R(Tx) 
= (δ). + (RT)« = (RS + RT)z. 

(3): [(S + ΤΕ] = GS + T)(Rz) = Sz) + Τα) = SR)t + 
(TR)x = (SR + TR)z. 

(4): [AST)]a = ALST)z] = A[S(T)| 

[AS)T]z = (S)(Tz) = AS(Tz)] 
[SAT)]z = S[AT)z] = S[A(Txz)] = NS(T2)). ἢ 

Thus, £(V) provides an example of the following type of abstract 
mathematical system: 
Definition 2. An algebra zs a vector space @ such that for each pair of 
elements a,b € ἃ, there is determined an element of @ called the product 
of a and b, denoted ab, subject to the following axioms: 

(Al) a(b+c) = ab+ac 

(A2) (6+ c)a = ba+ca 

(A3) A(ab) = (Aa)b = a(Ab). 
If moreover 

(A4) a(be) = (ab)e 
holds for all elements a,b,c, then @ is called an associative algebra. 


Examples 


1. If Ὃ is any vector space, £(U) is an associative algebra (see 
Theorem 8.1 and Theorem 1). 


2. Let ἃ be the vector space of all scalar-valued functions defined 
on a non-empty set 3 (Hxample 1.1.8). If x,y ΕΞ ἃ, define zy by the 
formula (ay)(t) = σ( (ἢ). With this “pointwise” definition of prod- 
ucts, it is easy to see that @ is an associative algebra. Also, ry = yx 
for all x,y ΕΞ ἃ (such algebras are called commutative). 


3. The vector space @ of continuous functions on [a,b] (Zzample 
1.1.8}, with products defined pointwise as in Example 2, is an associa- 
tive and commutative algebra. [Note: The product of continuous 
functions 7s continuous.] 


90 Introduction to Hilbert Space IV §5 


4. The vector space @ of polynomial functions on [a,b] (Example 
J.1.4), with products defined pointwise, is an associative and com- 
mutative algebra. 


5. Notation as in Example 2. Suppose moreover that J = @, the 
set of all scalars ἃ. Then, if x: Θ — Ὁ and y: Ὁ — @ are any two 
elements of @, there is also available a “composite product” of xz and 
y, defined as in § 4; let us denote this xoy, as distinguished from the 
pointwise product zy defined in Example 2. Specifically, (xoy)(t) = 
z(y(t)). With respect to the products τόμ, the vector space ἃ does not 
form an algebra, since the identities (Al) and (3) are not valid for 
composite products; the identities (A2) and (A4) do hold, and A(woy) 
= (Ax) oy. 


Exercises 


1. Let 2;,-++,2, be a basis for the vector space U. For each pair of 
indices j,k (7 = 1,--+,n; k = 1,--+,n), let Hj, be the linear mapping 
such that Ey.a0, = x; and ἔχει = 6 when i + k (see Theorem 1.3). 
Then: 

GQ) Lyk = Lx. 

(a1) EE px = ᾧ when 7 =f. 

Gii) Σ᾿, Bu = I. 


2. Suppose Ὃ is a vector space of finite dimension n. A “set of 
τ X n matrix units” in £(V) is a set of linear mappings Hj, ΕΞ £(V) 
(7 = 1,-++,n; k = 1,-++,), satisfying the relations (i), (ii), (iii) of 
Fzercise 1. Given such a set, prove there exists a basis 21,---,¢, of Ὁ 
such that Ej,.2; = 2; and ἔπεσι = θ when ἢ γέ k. 


3. If @ is an algebra, and ὅ is a non-empty set, the set of all map- 
pings x: 3 — @ is an algebra (with operations defined pointwise, as 
in Hxample 2). 


4, Let @ be any algebra, with products denoted ab. Introduce a 
new product [a,b] between elements a,b € ἃ, via the formula [a,b] = 
ab — ba; together with the vector space operations already given, @ 
is an algebra (generally non-associative) relative to the products [a,b]. 


5. Let @ be any algebra, with products denoted ab. Define a new 
product {a,b} via the formula {a,b} = $(ab + ba). Then @ is an 
algebra (generally non-associative) relative to the products {a,b}. 
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6. Starting with an algebra @, with products ab, let @, be the 
algebra constructed in Exercise 4. If the construction in Exercise 5 is 
applied to the algebra αι, describe the resulting algebra. Similarly, 
start with @, apply Hzercise 5 to get an algebra @o, then apply Exercise 
εἴ Qo. 


§ 6. CONTINUOUS MAPPINGS 


Suppose 2 is a closed linear subspace of a Hilbert space ac, and 
P:# — His the projection of σῦ on N (Definition 11.8.2). liz, > Z, 
then Px, — Px; for, by Theorem 111.8.1, || Pt, — Px|| = 
| P@n — 2) || < || en — x || — 0. So to speak, P “preserves” con- 
vergent sequences. This type of condition makes sense for metric 
spaces: 


Definition 1. Lei X and Y be metric spaces, ΤΊ: % -- Y. Let x € X; the 
mapping T is said to be continuous at x in case t, -- ὦ implies 
Tx, — Tx. If T is continuous at every x ΕΞ %, it is called a continuous 
mapping. 


Examples 


1. If Wis a closed linear subspace of a Hilbert space 50, the projec- 
tion P: σὺ — δῦ of 50 on 91 is a continuous mapping. 
2. If the metric spaces X and Y in Definition 1 are sets of complex 


numbers, with d(A,u) = |A — μ|, the notions of continuity are the 
classical ones, 

9. Let & be a metric space, y a fixed point of x. The mapping 
T: ὃ — ® defined by Tx = d(z,y) is continuous by Theorem J I A, 
For example, if y is a fixed vector in a pre-Hilbert space Φ, then 
«— || x — y || is a continuous mapping of @ into the metric space δὶ of 
real numbers; in particular, z — || z || is continuous (see also the 
Corollary of Theorem 11.8.1}. 


_ 4. If y is a fixed vector in a pre-Hilbert space ®, then « -- (z|y) 
is a continuous mapping of @ into the metric space Ὁ of complex num- 
bers (see Theorem 11.8.1). 


Theorem 1. If X and Ἢ are metric spaces, the following conditions on a 
mapping T: X% — Y are equivalent: 

(a) T is a continuous mapping. 

(Ὁ) Jf Sis any closed subset of Y, T—'(S) is a closed subset of 9. 
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Proof. 


(a) implies (b): Let 8 be a closed subset of Y, and suppose τ € Vis 
adherent to Τ (8). Choose any sequence a, € Τ 1(8) such that 
Ln — x. Then Tx, € 8, and since T'x, — Τὰ by the continuity of Τ᾽, 
Τα is adherent to 8; since 8 is closed, Tx € 8, thus « © T~'(S). 


(b) implies (a): Given x, — x, the problem is to show that 
Tx, — Tx. Assume to the contrary that for some e > 0, d(Tz,,Tx) > 
e for infinitely many n. Passing to a subsequence, we may suppose 
d(Tz,,Tz) > efor all ἢ. Let 8 = {y € Y: d(y,Tx) = εἰ. Τὸ is easy to 
see that 8 is a closed subset of Y (see Hxample III.2.6). Now, 
d(Tx,,Tx) > eshows that Τὰ, € 8, t, ΕΞ T'~'(S), hence x is adherent 
to Τ (8); since Τ᾽ (8) is a closed subset of 9 by the assumption (b), 
z € T (8). Thus, Τῷ € 8, d(Tz,Tx) > ε, a contradiction. J 
Corollary. Jf X and Y are metric spaces, T: X — Y is a continuous 
mapping, and y ΕΞ Y¥, then {x ΕΞ X: Tx = y} ts a closed subset of X. 


Proof. | 
{y} is a closed subset of y. J 


Exercises 


1. Let % and Y be metric spaces, S: X > Y and T:X -Ὁ Y 
continuous mappings. Prove: 

(i) {2 € αὐ: Sx = Tx} is a closed subset of &. 

(ii) If 8 is a dense subset of X, and Sx = Tz for all ᾧ € 8, then 
S = Tf. 


2. If α 4,2 are metric spaces, and 7: X — Y,S:Y — Z are con- 
tinuous mappings, then ST: X — Z is a continuous mapping. 


§7. NORMED SPACES, BANACH SPACES, CONTINUOUS 
LINEAR MAPPINGS 


From Chapter VI onward, we wil! be concerned primarily with con- 
tinuous linear mappings ΤΊ 3¢ — σὺ of a Hilbert space 5 into itself. 
Much of what can be said about such mappings can be expressed 
directly in terms of the norm, and properties of the norm, without 
having to refer explicitly to the scalar product from which the norm 


is derived. A convenient vehicle for discussions of this sort is the © 


concept of a normed space: 
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Definition 1. A normed space is a vector space & such that for each 
vector x € ὃ, there is defined a non-negative real number called the norm 
of x, denoted || x ||, subject to the following axioms: 


(N1) || «|| > O whenever x ¥ θ; || 0 || = 0. 
(N2) |le¢+yllSsizll+]y]. 
(N38) |] Ax |] = JA]- || 2]. 


In words, the real-valued function x — || x |! (x € 8) is strictly 
positive, subadditive, and absolutely homogeneous. It is clear from (N3) 
that || —2 || = || 2 || = || x |. 


Examples 


1. Every pre-Hilbert space is a normed space, with norms defined 
by the formula || « || = (¢|x)” (see Theorems 11.8.1 and 11.8.8). 


2. Let & be the vector space of continuous functions on [a,b] (see 
Example 11.4.8), and define the norm of zx to be the number 
LUB {|2(é)|:a < ¢ < 6}. This norm is denoted || z ||,,, to distinguish 
it from the pre-Hilbert space norm described in Example 11.8.9. 


3. If ὃ is a normed space, and % is a linear subspace of 8, then 9% 
is itself a normed space. ἢ 


Every normed space & is a metric space, with distances defined by 
the formula d(z,y) = || xz — y || (see the proof of Theorem 11.8.6). 
Thus (see §5 of Chapter II), a sequence ἀρ is (i) convergent to a 
vector zx in case || t, — x || — 0, (ii) Cauchy in case || x, — xp || > 0, 
(iii) bounded if there is a constant M such that || zp || < M for all n. 
If «>0, the sets {z: ||x—y|| <«}, {a: [{ -- ψ || Φ εἰ, and 
ix: || 2 — y || = εἰ are called, respectively, the open ball, closed ball, 
and sphere, of center y and radius ε. Spheres and closed balls are closed 
subsets of & (see Example ITT.2.9). 


Theorem 1. Jn any normed space: 
(1) Ifa, — randy, — y, then2n + yn > e+ y. 
(2) If tn — cand ry — ἃ, then Antn — λα. 


(3) If an and yn are Cauchy sequences of vectors, and Xp» is a Cauchy 
sequence of scalars, then Xn + Yn and Antn are Cauchy sequences, 
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(Ὁ {Π|1}} -- ἅν Π] Φ 115 -- ψἹ. 
(5) If tn — «, then || en || — || ||. If an ig Cauchy, then || xn || ts 
Cauchy (hence convergent). 
Proof, 
(1), (2), (3): See the proof of the Lemma to Theorem ITI.3.2, 
(4): See Theorem 11.4.1. 
(5): Immediate from (4). J 


In conformity with Definition 6.1, a mapping T: ὃ — §& between 
normed spaces is continuous at x € ὃ in case || x, — x || — Ὁ implies 
|| Tx, — Τῷ || -- 0. It is a continuous mapping if it is continuous at 
every « € ὃ. 

Theorem 2. Jf & and δ᾽ are normed spaces, and T: ὃ — F is a continu- 
ous linear mapping, the null space Xt of T ἐδ a closed linear subspace of ὃ. 
Proof. 


X is a linear subspace of & by Theorem 1.4. Suppose z, € I and 
In —> x. Since T' is continuous, Tx, — Tx. Thus, Tx = lim Tz, = 


lim @ = 6, hence z € 9. Thus, 90 is a closed subset of & (one could — 


also quote the Corollary of Theorem 6.1). ἢ 


For a linear mapping between normed spaces, continuity has sev- 
eral useful reformulations: 


Theorem 3. Lei & and 5 be normed spaces, T:& — 5 a linear mapping. 
The following conditions on T are equivalent: 


(a) Τ' ts a continuous mapping. 

(Ὁ) T ts continuous at some point x ΕΞ 8. 

(c) T' ts continuous αἱ θ © 8. 

(d) {ΠΤ} : || x || <1} ἐβ @ bounded set of real numbers. 


(e) There exists a constant M > 0 such that || Τὰ || < ἡ || x || for 
allz € &. 


Proof. 


(a) implies (b): trivial. 
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(b) zmplies (c): Suppose a, — 0. By Theorem 1, τη +2 — 0 + to 
= %. By assumption (Ὁ), T(t, +2) — Tx, thus Τὰ, + Τα, — 
TXo, Tt, = (Try - Tx.) — Τὰ, > 0. 


(0) ¢mplies (d): Assume to the contrary that {|| Tz || :|| 2 || <1} is 
unbounded. lor each n = 1,2,3,---, choose a vector z, € ὃ such that 
|| an | $1 and || Tz, || >. Define yn = (1/n)an. Since || ψ, || = 
(1/n) || tn || ΞΖ 1/n, yn — θ. By assumption (c), Ty, — 6. But, 
| Tyn || = (1/n) || Τὰς || > 1, a contradiction. 


(d) implies (6): Assume to the contrary that no such constant M@ 
exists. In particular, the constants 1,2,3,--- fail: for each n, choose 
Zn € ὃ so that || Tz, || > || 2, ||. Clearly x, γέ 0; defining y, = 


|| tn || "an, one has || yp || = 1, but || Typ || > 2, contrary to (d). 
(e) implies (a): If x, — x, then || Tz, — Tz || = || Tan —2) || < 
M ||2, —2x|| > 0, hence Tz, - Tx. ἢ 


In view of condition (d), continuous linear mappings between 
normed spaces are also called bounded linear mappings. 


Definition 2. /f & and F are normed spaces, and Τ': ὃ — § is a con- 
tinuous linear mapping, the non-negative real number 

LUB {|| Τὰ ||: || x || < 1} 
ts called the norm of Τ', and is denoted || T ||. 


Theorem 4. Let ὃ and F be normed spaces, ΤΆ 8 --Ξ 5 a continuous 
linear mapping. Then: 


(1) || T || = LUB {|| Tz | : [| 2 |] < 1}. 
(2) If ὃ # {6}, || T || = LUB {|| Τὰ [|| : || x || = 1}. 
(3) | Tz||< {{1Ὑ || - || c\|foralces. 


(4) If M20 and || Tx|| <M || || for all eee, necessarily 
ITI <u. 


Proof, 

(1): Let Καὶ = LUB {|| Tz || : || x || < 1}; clearly K < || ΤΊ. Sup- 
pose || x || < 1; given any e > 0, the vector y = (|| z || + εὐ τς has 
norm <1, hence || 7'y || - K; that is, || Tz || < K(|| 2] + ὁ. Letting 
€— 0, || Tz|| < K ||2|| < K. Taking LUB over ἢ <1, 7) <K. 
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(2): Since ὃ # {6}, there exist vectors ᾧ € ὃ such that || z |] = 1. 
Define N = LUB {|| Τὰ || : || || = 1}; clearly N < || T ||. Suppose 
| «|| < 1. If a = 0, then Tx = θ, hence || Tz || = 0 < N. If « #8, 
the vector y = || «||~‘w has norm 1, hence [| Ty || < N; that is, 
| Tx || < N || «|| = N. Thus, || 72 | < N whenever || x || < 1; it fol- 
lows that || T || < N. 


(3): Given any x € ὃ. If « = θ, clearly || Τὰ || < || T || - || x ||. If 
a #6, let y = ||2||~*a; since || y || = 1, || Ty|| < || Tl, that is, 
| Tz |] < |] TI - el. 


(4): Suppose M > 0 and || Tz || < M||z|| for all « Ε 8. If 
|z|| <1, | Tx || <M] « || < M; taking LUB over || z || < 1 
Τ| <M. J 


, 


Examples 


4, Let y be a fixed vector in a pre-Hilbert space @, and define 
T: @ — @ by the formula Tx = (x|y). Then, T is a linear mapping 
of ® into the one-dimensional Hilbert space @, and | Tx! = |(x!y)| < 
|| z || || y || shows that 7 is continuous, and || Τ' || < || y |]. In fact, 
| T || = || y ||. This is clear if y = 0; if y #0, || T || || yl] = |Ty| = 
I(yly)| = || ν ||? shows that || T || > | y |). 


5. If P and Q are pre-Hilbert spaces, and 7: ® -- ὦ is a continuous 
linear mapping, then || 7 || = LUB {|(Tz/y)|:|| || < 1, || y|| < 1}. 
For, let Καὶ denote the indicated LUB. If x € 0, y Ε Q, || x || <1, 
ly ll <1, then |(Tz|y)| <I ΤΉ} vl <si Tiel lvl <i 
hence K is a finite real number, and Καὶ < || 7 ||. If |] x || < 1 and || y |j 
<1, || Tx)| = |(Tz|y)| < K; fixing z and taking LUB over y, 
| Tx | < K by Example 4. Taking LUB over g, || 7 || < K. 


6. If σ is a Hilbert space of finite dimension n, and F is any normed 
space, then every linear mapping Τ': 30 — δ is continuous. For, let 


%1,*** tn be an orthonormal basis for 3¢. If « = = ΣῪ Mets is any vector 
in a, 


| Τὰ || = |] Sy (Lax) |] < DOP rel | Tee | 


«(Σ᾿ Ρ) (titer) 
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by the Corollary of Theorem 11.8.4. Thus, || Tx || < M || x ||, where 
Mg 

M = (= | Τὰς ) is independent of x. This shows that T is con- 
tinuous, and || 7 ||< Μ. §f 

The most obviously continuous mappings are those which preserve 
distance: 
Definition 3. If X and Y are metric spaces, a mapping T: X —> Y is 
said to be isometric tn case d(Tx,,Tx2) = d(x ,%2) for all 21,20 € Ὁ, 
Theorem 5. Jf & and & are normed spaces, a linear mapping T:&8 -- δ 
is isometric if and only if || Tx || = || x || for all « € 8. 
Proof. 

This is immediate from the relations || Τὰ — Ty || = || T(« — y) || 
and || Tz || = || Τὰ — Te]. J 

Metric spaces in which every Cauchy sequence is convergent are 
said to be complete. Pre-Hilbert spaces with this property are called 
Hilbert spaces. A normed space with this property is called a Banach 
space: 
Definition 4. A Banach space is a complete normed space. 

Thus, a Banach space is a normed space ® with the following prop- 
erty: if an ΕΞ Band || tz, — xy || — 0, there exists an « ΕΞ ὦ such that 
| πη — «|| > 0. 


Examples 


7. Every Hilbert space is a Banach space. In particular, @ is a one- 
dimensional Banach space. 


8. If ὦ is a Banach space, every closed linear subspace % of ᾧ is 
also a Banach space (see Theorem III.4.1). J 


Further examples of Banach spaces will be found in the exercises. 
The following theorem is a simple and useful application of complete- 
ness: 


Theorem 6. Lei S: 51 — @ be a continuous linear mapping, where & is 
a Banach space, and Nt is a dense linear subspace of a normed space 8. 
Then, there exists one and only one continuous linear mapping T:&8 > ᾧ, 
such that Tx = Sx whenever α © MN. One has || T || = || S ||. 
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Proof. 
Given x € ὃ, the problem is to define Tx € ἃ. Choose any sequence 
Yn ΕΞ MN such that yn — x. Since || Sym — Syn || = || S@m — Yn) || S 


15 |] |] Ym — Yn || — 0, Syn is a Cauchy sequence in @. Since ὦ is 
complete, Sy, is convergent. If also 2, ΕΞ 9ῖ and 2, — 2, S2zp is 
also convergent, and since || Sy, — Szn || = || ϑίψη — Zn) || < || 5} 
lyn -- ὅν || < |S ll (ll yn — @ |] + || z — 2n I) — 0, one has lim Sy, = 
lim Sz,. Thus, one can unambiguously define ΤῺ = lim Syp. Since 
| Syn |] < Sl ll yn ||, letting πὶ — © one has || Tx || < || S|} || x]. 

The linearity of 7: & — @ is easily deduced from the linearity of 
S, using Theorem 1. The above calculation then shows that 7’ is con- 
tinuous, and || T || < || S ||. Since Ty = Sy when y ΕΞ 91, it is clear 
that || T || > || S |]. 

The proof of uniqueness is left to the reader. See for instance 
Exercise 6.1. J 


Exercises 


1. If 91 is a linear subspace of a normed space ὃ, δὲ is a closed linear 
subspace of &. 

2. If & and § are normed spaces, Τὶ & — § is a continuous linear 
mapping, and z, is a Cauchy sequence in ὃ, then T'z, is a Cauchy 
sequence in §. Does the analogous result hold for a continuous map- 
ping between metric spaces? 

3. If & is a normed space, @ is a Banach space, and 7':&§ — @isa 
vector space isomorphism such that both 7’ and 7'~ are continuous, 
then & is a Banach space. 

4. If ὃ and § are normed spaces, 7': ὃ — § is linear, Καὶ > 0, and 
either {|| Τὰ ||: {ὦ || < K} or {|| Tx || :|] 2 || = A} is bounded, then T 
is continuous. 

5. If & is a normed space, and ὃ is a pre-Hilbert space, a linear 
mapping Τ ὃ — Ὁ is continuous if and only if the set 


{{(Ἰ||Ψ}}: {{π}} Φ 1,ν}} si 
is bounded; {πὸ LUB of this set is then = || T ||. 


6. Let & be the vector space of n-ples = (Aj,°++,An), and define 
"Πα 1 = D0) | de]. The result is a Banach space. 
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7. Let & be the vector space of continuous functions x on [a,b], 
where a < ὃ, and define 


[21 -- [ἸσΩ] αι. 


Then, & is a normed space, but is not a Banach space. 


8. Let 3 be a non-empty set. A scalar-valued function x defined on 
3 is said to be bounded if there exists a constant K such that |2()| < 
K for all t € 3. Denote by ὃ the set of all such 2, and define 


@+y0 =2O + yO 
(Az) (f) = Ax) 
|| # ἴω = LUB {|x@|:¢ Ε 35}. 
Then, & is a Banach space. 


9. Let & be the vector space of continuous functions on [a,b], 
normed as in Hxample 2. Then, & is a Banach space. 

10. (i) If & is any normed space, the identity linear mapping 
I: ὃ — & is continuous. If ὃ + {6}, || Z |] = 1. 

(ii) If & and ¥ are normed spaces, and 7’: ὃ — § is a vector space 
isomorphism such that both 7 and 7'~ are continuous, then || 7’~ || 
=e: 

11. Why does the argument in Example 6 fail for infinite-dimen- 
sional 50) Why does it fail if 5 is replaced by a finite-dimensional 
normed space & (even though 7’ is nevertheless continuous by /zer- 
cise 9.1 below)? 

*12. If Fis a normed space of finite dimension n, @” is the n-dimen- 
sional unitary space, and 7’: C" -- § is any vector space isomorphism, 
then both 7 and T~ are continuous. 

*13. Every finite-dimensional normed space δ is a Banach space. 


*14. Every finite-dimensional linear subspace of a normed space is 
closed. 

15. Let & be the set of all sequences + = (Ax) of scalars which are 
absolutely summable, that is, 0° |\.| < οὐ. With equality, sums, and 
scalar multiples defined as in Example 1.1.6, and with || z ||; = 
ΠΑ, 8. is a Banach space. 
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16. Let &),&,83,--* be a sequence of normed spaces, ὃ the set οἵ 
all “sequences” 2 = (αι), with x, © &, such that Σ ἢ | 2% |] < ὦ. 


With the obvious definitions (see Exercise 15), & is a normed space. & 
is a Banach space if and only if every & is a Banach space. 


17. Let &1,82,83,-** be a sequence of normed spaces, & the set of 
all “sequences” x = (2,), with 2 € &, such that D0? || xx ||? < ©. 


Define » 
ΓΗ͂Ν (x: | x I) | 


and define x + y, Az in the obvious way. Then, ὃ is a normed space. 
& is a Banach space if and only if every &; is a Banach space. 


*18. Let & be the vector space of m-ples « = (A,"**,An), p 8 fixed 
l/p 

real number >1, and define || x ||, = (Σιν ?) . Then, & is a 

Banach space. 


*19. Let & be the vector space of continuous functions z on [a,b], 
where a < ὃ, and let Ὁ be a fixed real number >1. Define 


| x | 1 =( Ἢ | x(t) |? a)” 


Then, & is a normed space, but is not a Banach space. 

20. Let & be the vector space of n-ples x = (Aj,***,An); With 
|| x [1 = max {!d,!,-+-,|An|}, & is a Banach space (see Exercise 8). 
If || x ||, is defined as in Ezercise 18, 


"Πα lle = si |  llp- 


*21. Suppose ὃ is a normed space such that || a + y ||? + || « — y ||? 
= 2|| 2/7 +2 || y ||? for all x,y € 8. Then, there exists a scalar prod- 
uct (x| y) on &, satisfying the axioms for a pre-Hilbert space, such that 
|| 2 || = (|x). Thus, a Hilbert space can be described as a Banach 
space whose norm satisfies the “parallelogram law.” 


ἢ 8. THE NORMED SPACE &,(&,5) 


Definition 1. [f ὃ and δ᾽ are normed spaces, &.(&,5) denotes the set of 
all continuous linear mappings T:& — &. 
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Theorem 1. If & and & are normed spaces, &,(&,) is a linear subspace 
of (8,5), and is a normed space with || T || = LUB {|| Tz ||: || x |] <1}. 


Proof. 

See Theorem 3.1 for the definition of the vector space (8,5). 

Clearly 0 € £,(6,%), and ||0|| = 0. If S,T € £.(8,5), and Δ is 
scalar, then for any || x || < 1, || (80+ T)a || = || Se + Tz || < || Sx || 
+ || Tel) < S| +] TI, andl] @7)x | = ||acrx) ἢ = |al- {{ Te I 
< |A|- || 7 |; thus S + T and AT are continuous, and taking LUB 
over ||z|| <1, one has ||S+T|| « || 8}} -Ε ΠΤ] and |aT|= 
[A] + |] 7 I]. Clearly |T || >OunlessT=0. J 


Theorem 2. If & is a normed space and ® is a Banach space, then 
£-(&,8) is a Banach space. 


Proof, 

Given a sequence of continuous linear mappings 7',: 8 — ὦ such 
that || 7, — 7, || — 0, the problem is to construct a continuous 
linear mapping 7': ὃ — @ such that || T — Τὶ || - 0. 

For each  Ε- &, || Tax — Tax }| = || (Pn — ΤΙ) || < || Tn — Ta ἢ} 
|| x -ὸ Ὁ shows that 7,2 is a Cauchy sequence in @®. Since @ is 
complete, Τ᾽, converges to a unique limit, which we denote Tx. Thus, 
the mapping Τὶ & — @ is defined by the formula ΤῊ = lim 7,2. It 
will be shown that (1) 7’ is linear, (ii) T is continuous, and (iii) 
| F — Fall + 6. 

By Theorem 7.1, T(x + y) = lim Τ, (ὦ + y) = lim (T,2 + Thy) = 
lim Τα + lim ΤῊΝ = Tx + Ty, and T(Az) = lim T,,(Az) = lim X(T,2) 
= A lim 7,2 = (Tx). Thus, T is linear. 

Since every Cauchy sequence is bounded (see the Lemma of 
Theorem 11.5.1), there is a constant K > 0 such that || 7, || < Καὶ for 
all n. Given any x € &; since 7,4 — Tx, || Tnx || -- || Tx || by 
Theorem 7.1, Passing to the limit in the inequality || T,2 || < K || « |], 
one has || 7. || < Καὶ || a ||. Thus, Τ is continuous, and || 7 || < K. It is 
now meaningful to write || 7 — 7, ||. 

Given e > 0, let N be an index such that || T,, — Τὶ || < ¢ when- 
ever m,n > N. Given z € δ, || 2 || < 1. If mn ἘΝ, || Tax — Taz | 

= || (Tm — ΤΩ} < || Tm — Tn | [1] < elle | « «fixing n > 
and letting m — ©, one has || Tz — ΤΊ || < «. Thus, | Tx — T,2 || 
S ¢ provided n > N and || a || < 1; fixing n > N and taking LUB 
over || x || < 1, || T — Τ᾿ || < ε. Summarizing: given any ¢ > 0, there 
1s an index N such that || 7' -- 7, || < ¢ whenevern >N. } 
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Exercises 


1. (i) If & and & are normed spaces such that £,(&,5) is complete, 
δ is not necessarily complete. 
(ii) If © isa pre-Hilbert space, Φ ~ {0}, and £,(0,) is com- 
plete, then ¥ is necessarily complete. 
*(iii) If § ~ {6} and £,(&,%) is complete, it can be shown that F 
is necessarily complete. 


2. Let ὃ and § be normed spaces, and 7',, Τ ΕΞ £-(&,5). One says 
that “Τ', — T uniformly” in case || T, — T || — 0, thatis, T, —- T 
in the normed space £,(&,5). One says that “T', — T strongly’’ in case 
7,2 — Tx for each σε δ, that is, || 7,2 — Tx || — 0 for each 
x € &. Prove: 

(i) If T, — T uniformly, then T, — T strongly. 

(ii) T, — T uniformly, if and only if: given any e > 0, there is an 
index N such that || 7,2 — Tx || < « whenever n > N and || z|| <1 
(so to speak, 7',2 — ΤῊ uniformly for z in the closed unit ball of 8). 


3. Let & and § be normed spaces, Το: ὃ — ¥ a sequence of con- 
tinuous linear mappings, and assume there is a constant K such that 
| Tn || < Καὶ for all n. 

(i) Let σ᾽ = {x € δ: Τα is Cauchy}. Then, 91 is a closed linear 
subspace of &. 

(ii) If ¥ is a Banach space, there is a continuous linear mapping 
T: 9 — § such that ΤΣ — Τὰ for alla € MV. 

(iii) Suppose 5 is a Banach space, and 9M is a dense linear subspace 
of & such that 7,,x is convergent for each x € 91. Then, there exists 
one and only one continuous linear mapping 7’: ὃ — δ᾽ such that 
T, — T strongly in the sense of Exercise 2. This result is known as 
the ‘‘Banach-Steinhaus theorem.” 


*4. Let ὦ be a Banach space, & a normed space, and 5 a set of 
continuous linear mappings 7’: ὦ — δ; in other words, 3 C £,(@,8). 
Suppose ὅ is “pointwise bounded,” in the sense that for each r Ε ὦ 
there exists a constant M, such that || Τὰ || < MM, for all T € 5. Then, 
it can be shown that there exists a constant M such that || T || < M 
for all 7’ © 5. This result is known as the “principle of uniform 
boundedness.” 

#5. If ὦ isa Banach space, ὃ is a normed space, 7',,7' ΕΞ £,(@,8), 
and Τ᾽, — Τ᾽ strongly, in the sense of Hzercise 2, there exists a con- 
stant M such that || 7, || < Μ for all n = 1,2,3,---. 
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*6. Let ὃ and § be Banach spaces, 9t a dense linear subspace of δ, 
and 7',;: ὃ — §¥ a sequence of continuous linear mappings. Suppose 
Τα is convergent, for each x € 9. In order that Τὰ converge strongly, 
in the sense of Hzercise 2, to a continuous linear mapping 7: ὃ — 5, 
it is necessary and sufficient that there exist a constant M such that 
| Τα || < Mf for all πὶ = 1,2,3,---. 


§9. THE NORMED ALGEBRA £,(&), BANACH ALGEBRAS 


Definition 1. If & is a normed space, £,(&) denotes the set of all continu- 
ous linear mappings T:& — ὃ, 


Thus, £,(8) is the normed space £,(&,&) described in Theorem 8.1. 
In addition, £,(&) contains the composite 57' of any two of its 
members: 


Theorem 1. Suppose T: & — §¥ and S: 5. — G are continuous linear 
mappings, where δ, 5, and G are normed spaces. Then ST: ὃ — G ts 
a continuous linear mapping, and || ST || < || § || || T |]. 


Proof. 

ST is linear by Theorem 4.2. For any x € &, || (ST)z || = || S(Tz) |] 
< || S|) | Tx || < || S|] || Z|] || x |]; this shows that ST is continuous, 
and || ST || < || S|! || J || by Theorem 7.4. See also Exercise 6.2. ἢ 


One already knows that £(&) is an associative algebra (Zxample 
5.1). Since &,(&) contains the composite ST of any two of its elements, 
and since the identities of Theorem 6.1 hold in particular for the 
elements of £,(&), it is clear that £,(&) is itself an associative algebra. 
Moreover, it is a normed space, and || ST’ || < || 8 || || Τ' || for every 
pair of elements. This is an example of the following: 


Definition 2. A normed algebra 18 an associative algebra ἃ which ts 
also a normed space, such that || ab || < || a || || b || for all a,b ΕΞ ἃ. If 
moreover @ is complete (hence is a Banach space), ἃ is called a Banach 
algebra. 


In the above definition, it is assumed that a + b and da have the 
Same meaning for the algebra structure and the normed space struc- 
ture, Summarizing, 


Theorem 2. If & is a normed space, &,(&) is a normed algebra. If ὦ is a 
Banach space, £-(@) is a Banach algebra. 
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Proof. 
If @ is a Banach space, £,(@) is complete by Theorem 8.2. ἢ 
In particular, 


Theorem 3. {ἢ ts a Hilbert space, the set £.(5C) of all continuous 
linear mappings Τ σὺ — σ is a Banach algebra, relative to the defini- 
(S+ Tix = Sa+ Tx 


AT)x = (Tx) 
(ST)z = S(Tx) 
| 7 || = LUB {|| Tx || : || z « 1}. 


Exercises 
1. If δ, are normed spaces, and ὃ is finite-dimensional, then every 
linear mapping Τ: 6 — § is continuous. 


2. Let 3 be a non-empty set, @ the Banach space of all bounded 
scalar-valued functions x defined on 3, with 


| © ἴω = LUB {|χ(Ὁ}: ὁ Ε 35} 


(see Exercise 7.8). With products zy defined by (xy)(é) = σ( (ὃ, ἃ is ; 


a commutative Banach algebra. 
3. Let @ be the Banach space of continuous scalar-valued functions 
x defined on the closed interval [a,b], with 
Il z lo = LUB {{π(Ὁ}: α Φ ε <b} 
(see Exercise 7.9). With products defined by (xy) (t) = σ(ἢ (ἢ, @isa 
commutative Banach algebra. 


*4, If & is a normed space such that £,(8) is a Banach algebra, 8 
is necessarily a Banach space. — 


ὃ 10. THE DUAL SPACE ε΄ 


Definition 1. A linear form on the vector space Ὃ is a linear mapping 
f: Ὃ — @; that is, f ΕΞ £(0,€@). 

Thus, a linear form on V is a sealar-valued function f defined on Ὁ, 
such that the relations f(x + y) = f(x) + f(y) and f(Ax) = f(x) hold 
identically. In the context of normed vector spaces, one may speak of 
continuous linear forms: 
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Definition 2. Jf & is a normed space, the normed space dual of & is the 
set &.(8,C) of all continuous linear forms 7: & — ©. Notation: 8' = 
£.(8,C). 

Theorem 1. If & 18 any normed space, &' is a Banach space relative to 
ihe definitions 

(f + οὐ) = f(x) + σῷ 
(Af) (x) = ΧΙ) 
Il f || = LUB {|f@)|: || @ |] < 1}. 


Proof. 
Since Ὁ is a Banach space, Theorem 8.2 applies. J 


Examples 


1. Each vector y in a pre-Hilbert space Φ determines a continuous 
linear form μ΄: Φ — Ὁ, namely y’(x) = («|y), and one has || ψ' || = 
|| y || (see Haxample 7.4). The abundance of continuous linear forms 
on @ is expressed in the following result: given any non-zero vector 
z€ @, there exists an f € Φ' such that || f || = 1 and f(z) = || z |]. 
For, setting f = || z 2’, one hasf() = || z |*z’@ = [1 2 ||"@l2= 
|| 2 ||. This illustrates the following: 

*2, If & is any normed space, and z is a non-zero vector of 8, it can 
be shown, using the “THahn-Banach theorem,” that there exists at 
least one continuous linear form f on ὃ such that || f || = 1 and f(z) = 
|| z ||. In particular, & ~ {6} implies &’ # {0}. In the sequel, this 
example will be referred to only in the exercises. 

Theorem 2. Let Φ be a pre-Hilbert space. For each y € Φ, define y’ € Φ' 
as in Hxample 1. Then, the mapping y — μ' of Φ into Φ' is 

(1) conjugate-linear: (y + 2)’ = ψ' + 2’ 

Ay)’ = A*y’, 
and 

(2) isometric: || y’ || = || y |). 

If this mapping is surjective, Φ is necessarily a Hilbert space. 
Proof. 

For allz € 6, (y + 2) (4) = (tly +2) = (ly) + (2) = γι + 
2'(x) = (ψ' + 2’)(@), and (Ay)'(z) = (αἰλῳ) = A*(ely) = A*y'(z) = 
(A*y’) (2). 


— 


Ἷ 
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It follows that (y — z)’ = y’ — 2’; as shown in Lzample 1, 
ll (Ψ — 2)’ || = lly —|l, thus || y’ — 2’ || = ||y — 2 ||, so that the 
mapping y — μ΄ is isometric in the sense of Definition 7.3. 

Suppose the mapping ἢ — y’ is surjective. Given a sequence 
Yn ἘΞ Φ such that || ym — yn || > 0; since || ym’ — yn’ || = || Ym — Yn |] 
— 0, yn’ is a Cauchy sequence in Φ', By Theorem 1, there exists 
an f € Φ' such that || yn’ — f || — 0. By assumption, f = ψ' for suit- 
able y € Φ, thus || yn —y || = || yn’ — ν΄ || = ll yn’ —f || 7 0. In 
other words, yn — y, and it has been shown that every Cauchy 
sequence in Φ has a limit in Φ. See also Exercise 7.3. J 


Conversely, it will be shown in Theorem V.1.1 that if σῷ is a Hilbert 
space, the mapping y — μ΄ described in Theorem 2 is a surjective map- 
ping of 50 onto JC’. 

Suppose ὃ is a normed space, &’ its dual space. Since &’ is itself a 
normed space, one can in turn form its dual space (8’)’. This is called 
the bidual of ὃ, and is denoted 8”; it is, of course, a Banach space 
(Theorem 1). There exists a natural mapping of & into 8”, defined as 
follows. Given any x € ὃ, define a mapping x”: &’ —» Θ by the for- 
mula α΄ (7) = f(x) (f € &’). Then, zx” is a continuous linear form on 
&’, as is shown by the calculations «’(f + g) = (f + g)(@) =f(@@) + 
g(x) = 2"(f) +2'(g), 2”) = ΟΡ = M(@) = λα" Ὁ), and | 2””()| 
= |f(@z)| < || F || | x ||. Thus, 2” € δ΄, and || x” || < || z|| (see Ba 
ercise δ). Moreover, the mapping  — 2” is a linear mapping of ὃ 
into &”, that is, ( + y)” = 2” + y” and (Az)” = dz”; for, 
( Ἐπ)" Οὐ) =flet+y) = 71(5) +f) = 2") Ἐν") = +99) 9), 
and (λα) (7) = fx) = Af(z) = λυ (ἡ) = (Av")(f). In the next 
chapter we shall be concerned with the following special case: 


Theorem 3. /f @ is a pre-Hilbert space, the natural mapping x — x” 
of Φ into ©”, described above, is 


(1) linear: (a ote y ” = gf ne yf!" 


(Aa iy — λα", 
and 


(2) isometric: || x’ || = || z |. 
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Proof. 
(1) is proved in the preliminary remarks, and (3) is immediate from 
the definitions of x” and y’. 


(2): || 2” || < || 2 |] was established in the preliminary remarks. 
Moreover, || 2 ||? = (z|z) = 2(z’) < || 1} || χ' |] = lz" Il || 2}, 
hence || x || < || x” || (even if z = θ). 


Exercises 


1. If 9 is a dense linear subspace of the normed space δ, and g is a 
continuous linear form on 91, there exists one and only one continuous 
linear form g on & such that ἢ (xz) = g(x) for all € 51. The mapping 
g — ἢ is an isometric vector space isomorphism of 9t’ onto δ΄. 


2. If & is a normed space of finite dimension n, then every linear 
form on ὃ is continuous, thus &’ = £(&,@). The dimension of &’ is 
also ἢ. 


3. If His a Hilbert space of finite dimension n, the mapping y — ψ' 
described in Theorem 2 is surjective; that is, given any f € Jt’, there 
exists a (unique) vector y ΕΞ 50 such that y’ = f. This result is gen- 
eralized to arbitrary Hilbert spaces in the next chapter (see Theorem 
V.1.1). 


*4. If αὶ and y are distinct vectors of a normed spaced ὃ, there 
exists an 7 ΕΞ &’ such that f(z) τέ f(y). 


*5. If & is any normed space, the mapping « — 2” of ὃ into 8”, 
described above, is isometric: || x’ || = || ||. 


6. A normed space ὃ, for which the mapping x — 2” of & into &” 
is surjective, is said to be reflexive. It will be shown in the next chapter 
that every Hilbert space is reflexive (see Exercise V.2.2). Prove: 

(i) Every finite-dimensional Hilbert space is reflexive. 

(ii) Every finite-dimensional normed space is reflexive. 

*(iii) Every reflexive normed space is a Banach space. 


Leena 


> 


7. Let 8 and § be normed spaces, 7’: ὃ — § a continuous linear 
mapping. Prove: 

(i) For each g € δ΄, the mapping z — g(Tz) is a continuous linear 
form on ὃ, hence defines an element of &’. 


= 


Moreover, 


(3) 2"(y’) = ν΄) = @ly), for all xy € 6. 


-- -“ eS 


ΠΡ ——— —_ ee 


----.- 
--- ———. 


ae ὦ, ᾿- - ἜΝ 
ee 
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(ii) For each g Εἰ δ', denote by T’g the continuous linear form 
described in part (i). Then, 7’: δ΄ — δ΄ is a continuous linear map- 
ping, and || 7’ || < || T }}. 

(iii) If also S € £,(&,5) and ἃ is scalar, (S + T)’ = S’+ Τ' and 
(AT)’ = XT’. Thus, Τ' — Τ' is a (continuous) linear mapping of 
£.(&,F) into &,(',8’). 

(iv) By (ii), one can in turn form (T")': &” — §”. Write T” = (7, 
Then, Tx" = (Tx)" for all z € δ. 

*v) |’ = TI. 


8. If &5,¢ are normed spaces, and T: ὃ — 5, S: 5. — G are con- 
tinuous linear mappings, then (ST)’ = T’S’ (see Exercise 7 for the 
notation). 

9. Let & be a normed space, and t,,2 ΕΞ δ. One says that “xz, — αὶ 
weakly” in case f(zn) — f(x) for each f ΕΞ &’. Prove: if x, — 2, then 
Yn — x weakly. (However, weak convergence does not imply con- 
vergence; see Exercise V.1.7,] 


10. Let ὃ and ¥ be normed spaces, Τὶ & — § a continuous linear 
mapping. Prove: if χω € δ, and z, — x weakly, then Tz, — Tz 
weakly [See Exercise 9 for the terminology.] 


11. Let 8,5. be normed spaces, and T,,7' ΕΞ £,(&,F). One says that 
“ἡ. —» T weakly” in case Τρ — Tx weakly, for each x € ὃ, in the 
sense of xercise 9. Consider the statements 

(i) T, — T uniformly (see Exercise 8.2) 

(ii) JT, — T strongly 

(iii) T, — T weakly. 

Then: (i) implies (ii), and (ii) implies (iii). 

Observe that in the normed space £,(&,5), “Τὶ, — T weakly” has 
another possible meaning, namely f(T,) — f(T) for each f€ 
[£.(&,F)|’ (see Exercise 9). The distinction between this, and the 
concept in (iii), has to be determined from the context. 


*12. If 8 and § are normed spaces, ὃ τέ {@}, and £,(&,5) is complete, 
then ¥ is necessarily complete. 


Continuous Linear Forms V 
in Hilbert Space 


§ 1. Riesz-Frechet theorem 

§ 2. Completion 

§ 3. Bilinear mappings 

§ 4. Bounded bilinear mappings 

§ 5. Sesquilinear mappings 

§ 6. Bounded sesquilinear mappings 

§ 7. Bounded sesquilinear forms in Hilbert space 

§ 8. Adjoints 
sss 


§ 1. RIESZ-FRECHET THEOREM 


Chapters VI onward are concerned with the study of continuous 
linear mappings 7’: # — 5 of a Hilbert space into itself. This study 
is flavored, so to speak, by the circumstance that the dual space 5c’ 
can in a natural way be identified with 5¢ (see Theorem 2 below); it is 
this “self-duality” of Hilbert space which distinguishes the theory of 
continuous linear mappings in Hilbert space from the theory of such 
mappings in Banach space. 

lf ® is any pre-Hilbert space, each vector y € Φ determines a 
continuous linear form y’ on @, namely y'(x) = (x|y). This gives rise 
to a mapping y — y’, of @ into its dual space Φ', which is conjugate- 
linear and zsometric (see Theorem IV.10.2). lf this mapping is surjec- 
live, Φ is necessarily complete; we are concerned here with the con- 
verse, which is the heart of the whole chapter (see Theorem 7.2): 


Theorem 1. (Riesz-Frechet theorem) If f is a continuous linear form 
on a Hilbert space HX, there exisis one and only one vector y ΕΓ 20 such 
that f(w) = (a|y) for all x € ae. 
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Proof. 

That is, given f € 3’, the problem is to find a vector y € σὺ such 
that 7’ = f. 


Let σ᾽ be the null space of f; it is a closed linear subspace of ὅζ, by 
Theorem IV.7.2. Τῇ NM = 50, then f = 0, and one takes y = 6. Assuming 
320, 91 XK; since σι Ὁ ὁ = M by Theorem III.6.2, it follows that 
at σέ {6}. Letz © 91", z γέ θ. Since 91 and 91: have only the vector 
§ in common, necessarily f(z) # 0; replacing z by [f(z)]~*z, we may 
suppose f(z) = 1. 

Given any x € 50; one has x — f(z)z C MN, since f[z — f(x)z] = 
f(x) — f(a)f) = f(a) — f(a) = 0. Since z 1 N, 0 = (@ — f(z)z|z) = 
(x|2) — f(a)(z|2). Thus, f(x) = @|z)—(@|z) = (ely), where y = 
(z|z)—*z is independent of x. That is, f = ψ'. 

If also f = w’, (z|y) = (x|w) for all x; y = w results from Theorem 
11.2.1. } 


Combining Theorem 1 and Theorem IV .10.2, 


Theorem 2. Let Φ be a pre-Hilbert space, y — y' the mapping of Φ 
into Φ' described in Theorem IV .10.2. This mapping is surjective if and 
only if Φ is a Hilbert space. 


Exercises 


1. Let f be a linear form on the vector space U, f ¥ 0, and 9 the 
null space of 7. Show that there exists a vector y with the following 
property: every z © VU has a unique representation = Ay + z, where 
z € Mand ἃ is a suitable scalar. 


2. Let U,f,9 be as in Exercise 1. Suppose IM is a linear subspace of 
© such that (1) 9% τέ {8}, and (2) 91 and 2 have only the vector @ in 
common. Prove that 911 is one-dimensional, and every z ΕΞ VU has a 
unique representation « = y + 2 with y (Ε MW andz€ MR. 


3. If @ is a pre-Hilbert space such that 91: ὁ = 9 for every closed 
linear subspace 9 of Φ, then Φ is a Hilbert space. 


4, Let f be a linear form on the Hilbert space 5C, 91 the null space 
of f. If f is not continuous, show that 91 is a dense linear subspace of JC. 


5. If f is a linear form on a Hilbert space 3C, and 91 is the null space 
of f, then f is continuous if and only if ϑὺ is a closed linear subspace. 
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6. If ὃ1 is a complete linear subspace of a pre-Hilbert space @, then 
Φ = 9ῖ (Ὁ 91. (this is Theorem ITI.6.1). Another (admittedly circular) 
proof of this can be based on Theorem 1. 


7. Terminology as in Exercise IV.10.9. Prove: 
(i) Ina pre-Hilbert space ©, if z, — x weakly, then (z,|y) -- (aly) 
for each y Ε @. 
(ii) In a Hilbert space δῦ, 7, — 2 weakly, if and only if (t,ly) — 
(x| y) for each y € δ. 
(iii) If x, isan orthonormal sequencein a Hilbert space, then x, — θ 
weakly, but one does not have z, — θ. 


8. Let ὃ be a normed space, i a Hilbert space, and T,,7 © 
£.(8,%). Then, T’,, —» T weakly, in the sense of Exercise IV.10.11, if 
and only if (7,,2|y) — (Τὰν) for each pair x Ε 8, y € 8. 


9. If 3 is a Hilbert space, the mapping z — 2” of 20 into 20" is 
surjective, hence 3¢ is reflexive. 


§ 2. COMPLETION 


This section is presented as an application of the Riesz-Frechet 
theorem. It may be considered “optional” and highly omissible. 

One knows that the pre-Hilbert space of finitely non-zero sequences 
(Example 11.1.3) can be enlarged to a Hilbert space, namely 13 
(Lzample 11.5.1). In what sense does this hold for an arbitrary pre- 
Hilbert space @? In any case, there is a mapping ὦ -- «’’, of @ into its 
bidual Θ΄", which preserves sums, scalar multiples, and norms (see T’he- 
orem 17.10.8). Let us denote this mapping by U; thus, U: Φ > @”, 
Ux = x’. The linear subspace U(®) of ©” is a normed space which 
is in a sense “equal” to the normed space ®; more precisely, U defines 
an isometric vector space isomorphism of Φ with U(®). Since @” is a 
Banach space, the closure U(@) is also a Banach space (see Example 
1.7.8 and Exercise IV.7.1). Thus, after “identifying” Φ with U (6), 
we may think of the normed space Φ as having been enlarged to the 
Banach space U(@). There remains the problem of providing every 
pair of vectors in U(®) with a scalar product satisfying the axioms in 
Definition 11.1.1, in such a way that the norm deduced from the scalar 
product is the same as the given Banach space norm on U(@). This is 
ΤΣ Ὁ ἜΗΝ theorem, in which, incidentally, it is shown that 
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Theorem 1. (Completion) If Φ is any pre-Hilbert space, there exists a 
Hilbert space 3%, and a mapping U: Φ — δῦ, such that: 


(1) U ts linear. 
(2) (Ux|Uy) = (aly) forall τον Ε. 6. 
(3) U(@) is a dense linear subspace of K. 


In fact, the norm in the bidual ©” can be deduced from a suitable scalar 
product, and one can take KH = 0", Ux = x”. 


Proof. 

One knows that ®” is a Banach space (Theorem IV.10.1), and that 
the mapping U: Φ — ©” defined by Ux = 2” is linear and isometric 
(Theorem IV .10.3). Moreover, 2’(y’) = ν΄) = (x|y) for all z,y ΕΞ Ὁ, 
where y — Μ΄ is the isometric conjugate linear mapping of Φ into Φ', 
described in Theorem IV .10.2. ze: 

Let 20 be the closure of U(@) in ©”; that is, H = U(@). Kisa 
closed linear subspace of ®” (see the proof of Theorem ITT.3.2), and is 
a Banach space (Example IV.7.8). Later in the proof, it will be seen 
that σὺ = Φ'', 

We now have a linear isometric mapping U: Φ — &, where JC isa 
Banach space and U(@) is dense in δῦ. 

A scalar product is introduced in 20 as follows. Given u,v € SC, the 
problem is to define (u|v). Choose sequences Z, ΕΞ δι Yn © Ὁ, such 
that Uz, — uand Uy, — υ. Since || tm — zn || = || Um — χη) Il 
= || Utm — U2n || > 0, tn is a Cauchy sequence in Θ᾽; similarly for 
the sequence yn. It follows from Theorem II.5.1 that (%n|%n) is 8 
convergent sequence of scalars. We wish to define 


(*) (ulv) = lim (αν Yn), 


hence must check that this limit is independent of the particular 
sequences approximating u and v. Suppose also Z, C &, Gn C ®, with 
Uz, > u and Ug, — v. Then, || Z, — 2m || = || Vin — Utn|| 5 
\|UZ, — ὦ} + || το Usp || + 0, and similarly || Jn — yn || > 0. 
The desired equality lim (2n|yn) = lim (€,|9n) results from the iden- 
tity (En| Jn) = (Xn | Yn) = (ἢ, — tnlGn = Yn) Ἤ (n| Gn — Yn) + 
(ἃ, — Xn| Yn) (see the proof of Theorem 11.8.1). Thus, (w|v) is unambig- 
uously defined by the formula (ἢ). Since Uz, — τι, one has || ἔχῃ | 
— || w ||, hence (u|u) = lim (#|2n) = lim || 2n ||? = lim || Usp ||* = 
\| « ||; from this it is clear that 
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(i) (u|u) > 0, (μ} Ὁ = 0 if and only if u = θ. 
Moreover, 


(ii) (v|u)* = (α 
(iii) (u + υ] ) = (ulw) + (υ} ὦ) 
(iv) (ul) = A(ulr); 


the proofs are left to the reader. 

Thus, the scalar products (u|v) satisfy the axioms for a pre-Hilbert 
space, and the relation (u|u) = || u ||? shows that the given notion of 
| uw || (caleulated as in Theorem IV.10.1) coincides with the norm 
(ει u)* deduced from the scalar product (via Definition 11.3.1); these 
two (equal) norms will not be distinguished notationally. Since 20 was 
already a Banach space, the scalar product (u|v) makes 5C into a 
Hilbert space. 

One has (Uz| Uy) = (| y) for all 2,y € @; for, with the notation of 
formula (ἢ), one can take z, = x and y, = y for all n. Thus, the 
Hilbert space 50 and the mapping U: Φ — σὺ fulfill the desired con- 
ditions (1),(2),(8). 

It remains only to show that 3¢ = ©’. The proof will be broken up 
into a series of remarks: 


1, [fu € &, then (11 = u(z’) for all x € Φ. 

For, let ἄῃ € @ with z,"" — u. Since z,'"(z') = (z,|x), one has 
(u|x"’) = lim (5,1 = lim (z_|2) = lim z,/"(x') = u(x") (for the 
last equality, see Exercise IV.8.2). 


2. If f € δ’, there ts a unique vector u ΕΞ KH such that f(x) = (x"’|u) 
jor allz Ε @. 

For, let g be the linear form on U(@) defined by σία") = f(z). 
Since [g(x")| = [f(@)| < ||} lz |] = lf ll 2” ||, g is continuous. 
Since U(@) is dense in 3C, by Theorem IV.7.6 there is a continuous 
lear form h on 50 such that h(x’) = g(x”) for alla ΕΞ ©. By Theorem 
1.1, there exists a vector u € 50 such that h(v) = (v|u) for ally € 50. 
In particular, (2”|u) = h(a”) = g(x”) = f(z) for all x € ὁ. 

Suppose also τὸ € K satisfies f(x) = (τ [ἡ for all ῳ ΕΞ &. Then, 
(x |u — w) = f(x) — f(z) = 0 for all z€ Φ. Choose a sequence 
t, € Φ such that 2,” — ὦ — το; then, (u — wlu — Ὁ) = 
lim (2,/"|u — w) = lim 0 = 0, hence τὶ — w = 0. 


3. 9ῖ ΞΞ {ψ: y € P} ts a dense linear subspace of 6’. 
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9 is the range of the conjugate-linear mapping y — Μ΄ of @ into 0’; 
it is clearly a linear subspace of δ΄. Suppose f < @’, and e > O is given; 
the problem is to find a vector y ΕΞ Φ such that || f — y’ || < «€ (see 


Exercise III.2.1). By Remark 2, there is a vector τὸ € 50 such that | 


f(x) = (x"’|u) for all x € Φ. Choose any y € Φ such that || u — μ΄ || 
< «; this is possible because U(®) is dense in 5C. For any τε. Ὁ, 
I —y')2)| = Ὁ -- νὴ! = 1@"lo) -- Gly = [Ἰὼ -- 


ely)| =(@"lu-—y)1 sie" }}} ὦ -- ψ΄' l=lelle-w ss 


e || x ||, hence || f — y’ || < « by Theorem IV.7.4. Finally, 


4. 90 = @", 

Given F € Φ'', let us show that F ΕΞ 3c. For all y € @, one has 
IFy)| <IF ily || Ξ 1} iyi =F ily” 1. Defining GY”) = 
[F(y’)]*, G is a continuous linear form on U(@). Repeating the argu- 
ment in Remark 2, there is a vector uC σὺ such that G(y") = 
(y"’|u) for all y ΕΞ Φ, hence (uly) = [G(y’)]* = F(y’). Quoting Re- 
mark 1, u(y’) = F(y’) for all y < Φ. That is, u(f) = Ὁ) for all 
f € 91, where 91 is the dense linear subspace of Θ΄ described in Remark 
3:u = F results from Theorem IV.7.6 (see also Exercise IV 6.1). Thus, 
F=ucexn. J 


Exercises 


1. If x, is an orthonormal sequence in a pre-Hilbert space , then | 


tn — 0 weakly (see also Hercise 1.7). 


2. If & is a Hilbert space, the mapping x — 2x” of K into K” is 
surjective, hence * is reflexive. 


3. If δ isa normed space, call a linear subspace % quadratic in case 
there exists a scalar product (|) defined for x,y € 9, such that (1) 
9 is a pre-Hilbert space, and (2) (x| x)” coincides with the given norm 
|| z || for « € 91. Then: 

*(j) A linear subspace 9 of ὃ is quadratic if and only if || 2 +- y ||? + 
lz—y|P? Ξ 2} « 1" +2] ν 15 forall ay € 0. mu 

(ii) If 9t is a quadratic linear subspace of ὃ, its closure 1 is also 
quadratic. 


4, Notation as in the proof of Theorem 1; in particular X is the 
dense linear subspace of & described in Remark 3 of the proof. Prove: 

(i) 9 is quadratic in the sense of Ezercvse 3. 

(ii) The Banach space Φ' is a Hilbert space. 

(iii) Hence, in turn, ©” isa Hilbert space. Let K = @ be the Hilbert 
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space described in (ii). There is a mapping f > 7’ οἵ & onto K' = @” 
by Theorem 1.2. Show: if ὦ € @, then (2’)’ = x”. 


5. Let ὃ be a normed space. A completion for & is a pair (®,U), where 

(1) ® is a Banach space, 

(2) U: ὃ —> ὦ isa linear mapping, 

(3) U is isometric, 

(4) U(&) is dense in @. 

Then: 

*(i) It can be shown that every normed space ὃ has a completion. 
Specifically, the mapping Ux = x” of & into &” can be shown to be 
isometric, and one can take for @ the closure in &” of the linear sub- 
space U(&). 

(ii) Suppose (ὦ, 17) is a completion for 8. If ¥ is a Banach space, 
and S: ὃ — 5 is a continuous linear mapping, there is one and only 
one continuous linear mapping 7: ® — § such that TU = S. More- 
over, || Τ ἢ = || $ 

(iii) If (@;,U;) and (@2,U2) are two completions for δ, there exists 
a unique mapping T: ®; — Bz such that: (1) 7 is a vector space 
isomorphism, (2) 7’ is isometric, and (3) Uz = TU}. In this sense, a 
completion for & is unique. 


5 If Φ is a pre-Hilbert space, a completion for Φ is a pair (5C,U), 
where 
(1) 5C is a Hilbert space, 
(2) U: Φ -- δ isa linear mapping, 
(3) (Uzx| Uy) = (a|y) for all σὺν € @, 
(4) U(@) is dense in 50. 
Then: 
ὰ (i) According to Theorem 1, every pre-Hilbert space has a comple- 
ion. 
(ii) Discuss the analogs of parts (ii) and (iii) of Exercise 5, 
(iii) Regarding Φ as a normed space, any completion for ®, in the 
sense of Lzercise 5, leads to a completion of the pre-Hilbert space @. 
*7. Let & be a metric space. A completion for % is a pair (¥,U), 
where | 
(1) Y is a complete metric space, 
(2) ὕ: ὃ - y, 
(3) d(Uz,Uy) = d(z,y) for all z,y € 3. (i.e. U is isometric), 
(4) U(X) is dense in ¥y. 
It can be shown that every metric space 9X has a completion (J, U). 


116 Introduction to Hilbert Space ν $3 


8. Let Φ = I @M in the sense of Definition I11.2.6. Viewing Φ 
as a linear subspace of its completion 50, one has 50 = IN (Ὁ 91. 


§ 3. BILINEAR MAPPINGS 


If Xx and ¥Y are non-empty sets, the Cartesian product of X and Y is 
the set of all symbols (x,y), called ordered pairs, where x Ε X and 
y & Y. If (@1,y1) and (x2,ye) are ordered pairs, the relation (#1,y1) = 
(xo,2) means that both αὶ = τ and y; = yz. The Cartesian product 
of X and Y is denoted X X Y. If u = (x,y) Ε ἃ X Y, @ is the first 
coordinate of u, y is the second coordinate of τι. Observe that Ἢ X is 
conceptually distinct from 80 X YY. 


Examples 

1. If δὶ is the set of all real numbers, R X δὶ is the “Cartesian 
plane.” Any mapping ΤΊ δὶ Χ δὶ — @is called a “real-valued func- 
tion of two independent real variables’; for example, T(z,y) = 
«2 — υ". 

2. If Φ is any pre-Hilbert space, the scalar product in @ can be 
thought of as a mapping 7: ΦΧ @ — 6, namely T(z,y) = ( [ν}. 
Definition 1. If U,U,W are vector spaces, a mapping gy: U XV > W 
is said to be bilinear in case the relations 

φίαι + %2,y) = (%1,y) + o(X2,¥) 
p(Ax,y) ma Ag(x,y) 
o(x,yi + Y2) = φία νι) + φία,"}.) 
y(z,dy) = Ag(z,y) 
hold identically. If moreover W = @, ¢ is called a bilinear form on 
UX DV. 

Thus, ¢ is a bilinear mapping of U X Ὃ into ‘W, if and only if: (1) for 
each fixed y € Ὁ, « — ¢(z,y) is a linear mapping of U into W, and 
(2) for each fixed « € U, y — ¢(z,y) is a linear mapping of Ὃ into 
«Ὁ. Briefly, ¢ is linear in each coordinate. Clearly 9(z,@) = ¢(0,y) = 8. 


Examples 


3. Let Ube the vector space of n-ples z= (Ay,++*,An), Y= (μιν: " * yn) 
---,and define y(z,y) = >.) λεμε. Then, gisa bilinear form onU XU. 
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4. Let U and W be vector spaces, 7: U — W a linear mapping. 
Let Ὁ = £(°W,C) be the vector space of all linear forms on ἊΨ), and 
define φ: U XU — Ὁ by the formula ¢(2,f) = f(Tx). Then, gis a 
bilinear form on U X V. 


5. Let U,U,W be vector spaces, f a linear form on U, and 7:0 --ὁ W 
a linear mapping. Define ¢: U X Ὃ — W by the formula ¢(z,y) = 
f(x)(Ty). Then, ¢ is a bilinear mapping of U X Ὃ into W, 


6. If ἃ is an algebra, the mapping φ: ἃ X @ — @ defined by 
g(a,b) = ab is bilinear. In particular, if Ὃ is any vector space, the 
mapping (S,7') — ST of £(V) X £(V) into L(V) is bilinear (see 
Theorem IV .5.1). 


Exercises 


i If x and Y are non-empty sets, there exists a natural bijective 
mapping Τ:α XY —> YX &. 


= If Xx,Y,Z are non-empty sets, there exists a natural bijective 
mapping 7: ὃ X (Y X 2) — (X& XY) X Z. 


3. Any mapping ΤΊ ὃ. — ἃ determines a subset of X& X ‘YY, namely 
πὶ set of ordered pairs Gr = {(z,7'x): x € 50}, called the graph of 7’. 
rove: 
(i) 15: 85 — YandT:xX — y, thenS = Tif and only if $5 = Gp. 
(ii) Given 6 C & Χ Y. There exists a mapping ΤΊ ὃ — Y whose 
graph is 6, if and only if: (1) 9 ¥ ©, and (2) given (z,y,) Ε 9 and 
(1,1) e σ, necessarily Yi = Ya. 


4. If U,0,W are vector spaces, denote by @(U,U; W) the set of all 
bilinear mappings g: UX UV — W. For two such mappings φιψ, 
write ¢ = Ψ in case o(z,y) = ¥(x,y) for all z ΕΞ U and y € Ὁ. Then 
(U,V; W) is a vector space, relative to the operations 


(e+ ψ)ία,ν) = (ay) + ¥(z,y) 
(Ag) (x,y) = Ag(z,y). 

5. Notation as in Exercise 4. Suppose ¢ ΕΞ (αι, Ὁ; W). Eachz € αἱ 
determines a linear mapping gz: U — W, defined by ¢,(y) = ¢(z,y). 
Each y € Ὃ determines a linear mapping φῦ: U — W, defined by 
¢" (x) = ¢(x,y). Prove: 

(i) For each fixed g € ®(U,U; ‘W), x — ὡς is a linear mapping of 


U into £(0,W). 
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(ii) For each fixed ¢ € @(U,0; Ὁ), y -- φῇ isa linear mapping of 
Ὁ into £(U,W). | 

(iii) For each fixed eC U, φ — ¢z is a linear mapping of 
@(U,V; W) into £(V,W). 

(iv) For each fixed y€U, φ > ¢” is a linear mapping of 
Θίαι, Ὁ; W) into £(U,%W). 

(v) For each fixed pair (x,y) Ε U Χ Ὁ, ¢ — ¢(2,y) isa linear map- 
ping of Θίαι, Ὁ; W) into W. 

6. Notation as in Exercise 5. Suppose 9 © @(U,0; W). The map- 
ping z — ¢-; is a linear mapping of ἅϊ into L£(0,W); denote it by ¢, 
that is, (xz) = φ,.. Thus, ῷ ΕΞ £[U,£(0,W)]. Prove: g — é is a vector 
space isomorphism of ®(U,0; W) with £[U,£(0,W)]. 

7. Notation as in Exercise 5. Using the techniques of Exercise 6, 
obtain a vector space isomorphism ¢ — ᾧ of @(U,0; W) with 
£[V,£(U,W)]. 

8. Let U,W be vector spaces, UV = £(U,C) the vector space of 
linear forms f on ‘U. Prove: 

(i) For each fixed z € ‘W, (z,f) — f(a)z is a bilinear mapping of 
Ὧι X V into W. 

(ii) For each fixed f € Ὁ, (α,2) — f(x)z is a bilinear mapping of 
UX W into W. 

(iii) For each fixed x € U, (f,z) — f(a)z is a bilinear mapping of 
Ὃ X W into W. 

9. Given a fixed bilinear mapping ¢ of U X VU into W. For subsets 
$C U and 3 C Ὁ, define 


81 = {y € Ὁ: o(z,y) = θ for all z € 8) 


τ = {x € U: g(x,y) = θ for all y € 5}. 

Prove: 

(i) $+ and “5 are linear subspaces of U and UL, respectively. 

Gy sc ~(s*); ends cS C5). 

(iii) If 81 C So, then $,+ D 82; if σι C Je, then +5, D "Ὁ. 

(iv) (4(8+))* = 8+, and *((*3)*) = "Ὁ. 

10. Let U and ‘W be vector spaces. Prove: 

(i) U X W becomes a vector space, on defining (1,41) + (v2,y2) = 

(αι + 22,y1 + ye) and (x,y) = (Az,Ay), with zero element (6,6), and 


—(z,y) = (— 2, —y). Ὃ Χ W is called the vector space direct product — 


of Ὁ and ‘W. 
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(ii) A mapping 7: 0 — ‘W is linear if and only if its graph Gr is a 
linear subspace of U X W (see Exercise 3). 

(iii) ἃ — (2,0) is an injective linear mapping of U into U Χ ‘W; 
denote its range by U,. Similarly y — (6,y) is an injective linear map- 
ping of W into U X ‘W; denote its range by W,. Then: U, and W, are 
linear subspaces of Ὃ X W, and every vector in U X ‘W has a unique 
representation in the form v, + w,, with v, ΕΞ VU, and w, ΕἾ “Ὁ. 

(iv) Suppose SW and ® are linear subspaces of a vector space 4, 
such that (1) 9% + 9 = U, and (2) the only vector common to 9It 
and Nis 0. Then, there is a natural vector space isomorphism of U with 
the direct product vector space It X 2. 


11. Let ®,Q be pre-Hilbert spaces, and let ® X © be their vector 
space direct product, as defined in Exercise 10. Prove: 

(i) ® X Q becomes a pre-Hilbert space, on defining ((21,y1) | (72,42) 
= (119) + (y1|y2). P X Qis called the pre-Hilbert space direct prod- 
uct of Φ and 8, 

(ii) One has || (2,y) ||? = || x ||? + |[ y |/?. 

(111) Let ®, be the range of the isometric linear mapping z — (5,0) 
of Φ into ® XQ, and Q, the range of y — (6,y). Then @, = Q,+, 
Q = 0,*, and © XO = 6, (Ὁ Q, in the sense of Definition III.2.5. 

(iv) One has (χη) η) — (z,y) if and only if both z, — x and 
Yn —> Y. 

(v) ® X Qis a Hilbert space if and only if both Φ and ὃ are Hilbert 
spaces. 


12. Let X& and Y be metric spaces. Prove: 

(i) X& K Y becomes a metric space, on defining d((2x,4;),(x2,ye)) = 
d(x1,2%2) + d(y1,y2). X X Y is called a metric space direct product of 9 
and Y (there are other natural ways of defining distances in & X 4). 

(ii) One has (a,¥n) — (z,y) if and only if both z, — a and 
Yn —> ¥. 

(iii) If Τ᾽: 80 — Y is a continuous mapping, its graph Gr is a closed 
subset of ὃ & Y. 

(iv) For each fixed y, € Y,  — (x,yo) is an isometric mapping of 
x onto a closed subset ὃ, of X X Y. Similarly for the mapping 
Ψ — (x,y), where x) € &X is fixed. 

(v) ὃ X Y is complete if and only if both 9X and Y are complete. 

(vi) Given completions for X and Y, in the sense of Ezercise 2.7, 
there is a natural way of obtaining a completion for X X y. 
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13. Let & and δ᾽ be normed spaces, and let ὃ X δ᾽ be their vector 
space direct product, as defined in Fzercise 10. Prove: 

(i) & Χ ¥ becomes a normed space, on defining || (x,y) || = || x || + 
yi 

(ii) « — (x,6) and y — (6,y) are isometric linear mappings. 

(iii) If distances in 8,5, and ὃ X § are defined as in § 7 of Chapter 
IV, & X 5 is the metric space direct product of ὃ and δ, in the sense 
of Exercise 12. 

(iv) If 7: & — Φ is a continuous linear mapping, its graph Gr is 
a closed linear subspace of & X 5. 

(v) & X Fis a Banach space if and only if both & and § are Banach 
spaces. 

(vi) If 8 and δ happen to be pre-Hilbert spaces, (n,n) — (x,y) 


in the above sense if and only if (2n,yn) — (,y) in the sense οὗ 


Exercise 11. 


14. If X and Y are metric spaces, 7:X — Y is a bijective mapping, 
S = Τ᾽, and Gr is a closed subset of ἃ X Y (see Exercise 12), then 
Gs is also closed. 

15. (i) If δ isa Banach space, ¥ is a normed space, and 7’: ὃ - δ 
is a continuous linear mapping, then the normed space Gr (see Exercise 
13) is a Banach space. 

*(ii) If ὃ and ¥ are Banach spaces, and 7: ὃ — § is a continuous 
vector space isomorphism, it can be shown that Τ᾽ is also continuous. 

*(iii) If ὃ and ¥ are Banach spaces, and 7’: ὃ — δ is a linear map- 
ping such that Gr is closed, it can be shown that T is continuous. 


ἢ 4, BOUNDED BILINEAR MAPPINGS 


Definition 1. Jf &,5,G are normed spaces, a bilinear mapping 
φιᾶ ΧΟ — G is said to be bounded in case there exists a constant M 
such that || o(x,y) || < M || «|| || y || for alla ΕΞ ὃ and y € δ. If more- 
over Ὁ = ©, ¢ is called a bounded bilinear form on ὃ X σ. 


Examples 
1. Let ὃ and G be normed spaces, 7’: ὃ — G a continuous linear 


mapping, and δ᾽ = G’ the dual space of G. Define φ ὃ X F — @ by © 


the formula ¢(z,f) = f(T x). Then, ¢ is a bilinear form on ὃ X § (see 
also Ezample 3.4), and |g(x,f)| = [f(Tz)| < \lf ||| Τὰ Is 
| 7 || (Πα || || f 11) shows that ¢ is bounded. 
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2. If @ is any normed algebra, then the bilinear mapping 
yg: @ X @—@ defined by y(a,b) = ab is bounded: || g(a,b) || < || a || ||} |]. 
In particular, if € is a normed space, (8, ΤΊ — ST defines a bounded 
bilinear mapping of £,(&) X £.(&) into £.(&) (see Theorem IV.9.1). ἢ 


Using arguments similar to those in Theorems IV.7.3 and IV.7.4, 
the condition for boundedness of a bilinear mapping can be refor- 
mulated as follows: 


Theorem 1. [f &,5,G are normed spaces, and ¢: & X F§ — 9 is a bi- 
linear mapping, the following conditions on ¢ are equivalent: 

(a) is bounded. 

(Ὁ) My, = LUB {|| o(@,y) ||: || |] < 1, ll y |] < 1} as finite. 

(c) Ἧς = LUB {|| o(@,y) ||: || el] < 1, || yl] < 1} és finite. 

(dq) Mz = LUB {|| ¢(@,y) ||: || x || = 1, | yl] = 1} ts finite. 
If ¢ is bounded, My = Mz = M3; denoting the common value by || ¢ ||, 
one has 

(1) lle@y I <llelliziilyll for Wd ces, yes. 
Moreover, || ¢ || is minimal in this property: 


(2) If M2 0 and || φ(α, μὴ) || < M||z|| ||y|| for all eC & and 
y Ε 5, necessarily || || < M. 


Definition 2. Notation as in Theorem 1. The number || ¢ || is called the 
norm of the bounded bilinear mapping ¢: ὃ X δ — G. Thus, 


| ¢ || = LUB {|| ¢(@,y) ||: |] = |] < 1, ll yl] < 1}. 


Clearly || ¢ || = 0 if and only if g(x,y) = θ for all x € 8, y Ε 95; 
otherwise, || ¢ || > 0. In condition (d) of Theorem 1, one assumes 
ὃ γέ {0}, 5. γέ {0}, so as to have a supply of unit vectors. 


Exercises 


1. If &,%,G are normed spaces, and ¢: ὃ X § —> G is a bilinear 
mapping, the following conditions on ¢ are equivalent: 

(a) ¢ is bounded. 

(Ὁ) Ifz, — wand yn — y, then o(%n,yn) > φία,"). 

(c) Ifa, — @and y, —> θ, then 9(%n,yn) > 0. 


2. There is a normed space analog for Exercise 3.8. 
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8 There is a normed space analog for Exercise 3.9: assuming ¢ 
bounded, $+ and ἐδ are closed linear subspaces. 


4. If &,5,G are normed spaces, denote by Θ,ί(δ,5; G) the set of all 
bounded bilinear mappings ¢: ὃ X ¥ - §. Then, @,(8,5; 6) 1s a linear 
subspace of @(8,F; 9) (see Exercise 3.4), and becomes a normed space 
when || ¢ || is defined as in Definition 2. 


5. Let &5,¢ be normed spaces, ¢: ὃ XF — Ga bounded bilinear 
mapping. Show: ἐν as 

(i) For each fixed x € & y > ¢2(y) = ¢(2,y) is a continuous linear 
mapping of δ into 9, and || gz || < || ¢ || || x ||. Thus, ez € £-(5,9). 

(ii) For each fixed yO S, x > ¢¥(x) = g(x,y) is a continuous 
linear mapping of & into 8, and || φ᾿ || < || ¢|| | yl. Thus, φῇ € 
£-(8,8). * * 1" _s 

(iii) The mapping x — ¢(x) = ¢- 1s ἃ continuous linear mapping 
of 8 into £.(F,§), and || ¢|| < || φ ||. Thus, ᾧ Ε- £18, £-(5,9)]- 

(iv) The mapping y — O(y) = ¢ is a continuous linear mapping 
of 5 into £2(6,), and || @ || < || φ ||. Thus, Ὁ € LA5,L4(E,9)]. 

(v) In fact, || ᾧ || = |] ol = ll δ 1}. 


6. Notation as in Fzercise 5. Prove: ͵ Ἵ 

(i) The mapping φ — ᾧ is an isometric vector space isomorphism 
of Θείς, 5; Ο) with £,[6,£,(F,$)). ᾿ . 

(ii) The mapping ¢ — ᾧ is an isometric vector space isomorphism 
of &.(8,; 6) with £{F,£-(8,9)]. 


7. Let & and ¥ be normed spaces, ὦ a Banach space. If 0 is 8 
dense linear subspace of ὃ, 9t is a dense linear subspace of δ, and 
v: Mt X MW -- ὦ is a bounded bilinear mapping, there exists one and 
only one bounded bilinear mapping ¥: ὃ X δ᾽ - @such that v(x,y) = 
y(a,y) for all  € 91 and y € 9ῖ. 


*8 Let &5,G be normed spaces, and g: ὃ Χ § — G a bilinear map- 
ping. Suppose that for each x € ὃ," — ¢(@,y) 15 continuous, and that 
for each y € 5, x — g(2,y) is continuous. Assuming either ὃ or δ᾽ 1s 
a Banach space, it can be shown that ¢ is bounded. 


*9. Let & and G be normed spaces, ¥ = G’ the normed space dual 
of 9. Each T € 2&,(8,S) determines a bounded bilinear form 


:& X ¥ — @, namely ¢,(2,f) = f(T'x) ‘see Example 1), and one has. 


ll ep Il < IT I. 
'} hitetoraela l= IT tl. 
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(ii) 7 — gy, is an isometric linear mapping of £,(&,S) into 
Be(E,F; 6). 

(iii) If moreover G is reflexive, the mapping T — ¢p is surjective, 
hence is an isometric vector space isomorphism. 


10. The following is a specialization of Exercise 9 which can be 
worked out using material in the text. 

Let σὺ be a Hilbert space. Each 7 € £,(3C) determines a bounded 
bilinear form ¢,: Ὁ X 5’ — @, namely ¢,(2,f) = f(T'z). In view of 
Theorem 1.1, this means ¢,,(z,y’) = y'(T'x) = (Tzx|y) for all 2,y € 3. 
The relation || g,, || = || 7’ || is elementary (see Example IV.7.5). More- 
over, ὅ is reflexive, that is, the mapping 1 — 2” of 50 into 3C” is 
surjective (Zxercise 1.9). It follows that 7’ --Ὁ Yp 1s an isometric vector 
space isomorphism of £,(5C) with @,(5C,3C’; 6). 

11. If ὃ is a normed space, and 3¢ is a Hilbert space, there exists 
an isometric vector space isomorphism 7 — ¢, of £,(&,5¢) with 
®.(8,5C’; 6). 

12. If ὃ and δ᾽ are normed spaces, and G is a Banach space, then 


@-(&,F; 9) is a Banach space. In particular, @,(8,F; 6) is a Banach 
space. 


ἢ 5. SESQUILINEAR MAPPINGS 


Definition 1. If U,U,W are vector spaces, a mapping ¢: UX Ὃ > W 
ts said to be sesquilinear in case the relations 


φίαι + 22,y) = 9(21,y) + o(x2,y) 
e(Ax,y) = Ag(x,y) 
φία,νι + Yo) = 9(z,y1) + ¢(2,y2) 
φία,λψ) = A*o(z,y) 
hold identically. If moreover Ἃ = @, ¢ is called a sesquilinear form 
on UW Χχ Ὁ. 


Thus, ¢ is a sesquilinear mapping of αἰ X Ὃ into W, if and only if: 
(1) for each fixed y € VU, x — (x,y) is a linear mapping of 41 into W, 
and (2) for each fixed « € U, y — ¢(z,y) is a semilinear mapping of 
Ὃ into W (see Definition IV.1.8). Briefly, ¢ is linear in the first 
coordinate, semilinear in the second coordinate. Clearly ¢(0,y) = 
¢(x,0) = 0. 
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Examples 


1. If @ is any pre-Hilbert space, the mapping ¢: ® X P - © 
defined by o(x,y) = (x|y) is a sesquilinear form on ® X Ὁ. 


2. If U is a vector space, @ is a pre-Hilbert space, and 7: WU — Φ 
is a linear mapping, then (x,y) = (Tx|y) defines a sesquilinear form 
on U X Ὁ. 


3. If φι UX Ὁ —> Ὁ is a sesquilinear form, then the mapping 
v:0 XU -- 6 defined by ¥(y,z) = [o(z,y)]* is sesquilinear. J 


Obviously, one could paraphrase the results of the preceding two 
sections for sesquilinear mappings. However, there is a simple way of 
deducing the theory of sesquilinear mappings from the theory of 
bilinear mappings, via the concept of the complex-conjugate of a 
vector space: 


Definition 2. Lei U be a vector space, with sums denoted τ + y, and 
scalar multiples denoted λα. The complex-conjugate of Ὃ ἐβ the vector 
space Ὃ Ὁ obtained as follows: 


(1) The set Ὃ is the same as the set V. 


(2) [τῷ € V*, the sum of x and y is defined to be x + y (as cal- 
culated in Ὅ). 


(3) If isa scalar, and x ΕΞ U*, the multiple of x by ἃ 18 defined to 
be \*x (as calculated in VU); this will be denoled doz. 


It is easy to check that U* satisfies the axioms for a vector space 
listed in Definition 1.1.1. For example: (Au)oxr = (Au) ἢ = ("μ᾿ = 
*(u*xz) = A*(uox) = Ao(uox). It is clear that U* is the only vector 
space structure which can be defined on the set Ὃ in such a way that 
the identity mapping 1: Ὃ — U* is conjugate-linear. 


Examples 


4, If @ is any pre-Hilbert space, the mapping y — ψ' of @ into (@’)* 
is linear; for, (y +2)’ =y' +2’, and (Ay)’ = A*y’ = doy’ (see 
Theorem IV .10.2). 


5. If © and ‘W are vector spaces, a mapping 7’: Ὃ — ‘W is con- 
jugate-linear if and only if 7 : Ὅ" — W is linear (if and only if 
T:U — W* is linear). 
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6. If ὃ isa normed space, then &* is also a normed space with || z || 
defined the same as in & The point is that || doz || = || A*z || = 
[ἘΠ || x || = |A]- |] ||. The identity mapping 1: ὃ -- &* is then 
isometric and conjugate-linear; ὃ is a Banach space if and only if 8* is 
a Banach space. 


7. Combining Examples 4 and 6, y — y’ is an isometric linear 
mapping of @ into the normed space (6) ", 


8. If ® is a pre-Hilbert space with scalar products denoted (1 y), 
the vector space ®* becomes a pre-Hilbert space on defining scalar 
products [| y] by the formula [x/y] = (z|y)* = (y|x). For instance, 
[Aoa|y] = [A*x]y] = (y|A*x) = A(y|z) = Alz]y]. Thus, ®* acquires a 
norm from its scalar product, as well as the norm inherited from @ 
via Example 6; these norms coincide: [x|x]’* = (a|z)’*. @ is a Hilbert 
space if and only if ®@* is a Hilbert space. J 


The reason for introducing the space Ὃ is the following theorem, 
whose proof is immediate from Example 5: 


Theorem 1. Lei U,U,W be vector spaces, and ¢:U Χ Ὃ — W. Then, 
φ is a sesquilinear mapping of UX Ὃ into W if and only if it is a 
bilinear mapping of U X U* into W. 

The “polarization identity” (see Theorem 11.8.8) has a generaliza- 
tion for sesquilinear mappings: 


Theorem 2. If 0 and W are vector spaces, and ¢: 0 X Ὃ -- W is ses- 
quilinear, then 
g(x,y) = Hye@ - y,2 +9) — φία — y,2 — y) 

+ ἰφία + ty, + iy) — ἰφία — ty,x — ty)} 
for all x,y Ε Ὁ. 


Proof. 


One has φίῳ + y,« + y) = φία,2) + o(y,y) + o(z,y) + φίν,2). The 
proof proceeds as in Theorem 11.8.8, J 


An immediate consequence of Theorem 2 is the following: 


Theorem 3. If 9:0 X 0 — W andy: UX VD — W are sesquilinear 
mappings such that o(z,x) = ¥(x,x) for all x ΕΞ Ὃ, then φ = ψ, that is, 
o(x,y) = ¥(2x,y) for all zy & Ὁ. 
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Definition 3. Let Ὃ be a vector space, φ U Χ Ὃ — Θ a sesquilinear 
form. ¢ is said to be Hermitian in case ¢g(y,x) = [¢(z,y)]* for all 
zy €& ῦ. 


Theorem 4. A sesquilinear form φὶ Ὃ X Ὃ — © is Hermitian if and 
only if o(xz,x) ts real, for all x ΕΞ Ὅ. 


Proof. 
If ¢ is Hermitian, ¢(x,~) = [¢(z,x)]*, hence ¢(2,2) is real. 
Suppose, conversely, that ¢(2,x) is real for all «. Define ¥(2,y) = 
[e(y,x)|*; as noted in Example 3, ¥ is a sesquilinear form on Ὃ X Ὁ. 
By assumption, ¥(z,x) = ¢(2,2) for all z, hence Ψ = ¢ by Theorem 3. 
In other words, ¢ is Hermitian. ἢ 


Definition 4. Let U be a vector space, ¢: 0 X Ὃ — © a sesquilinear 
form. ¢ is said to be positive in case 9(2,x) > 0 for allx ΕΞ V. 


In particular, a positive sesquilinear form is Hermitian, by Theorem — 


4. There is a “‘Cauchy-Schwarz inequality” for positive sesquilinear 
forms: 


Theorem 5. If o is a positive sesquilinear form on Ὃ X Ὅ, then 
| g(x,y) |* < e(a,x) ¢(y,y) for all x,y € Ὁ. 


Proof. 

By analogy with scalar products, write || z || = V¢(z,z), only for 
convenience in the present proof. 

If o(z,2) > 0 or ¢(y,y) > 0, the proof proceeds as in Theorem 
11.8.4. 

Suppose ¢(z,z) = φίψ,}}) = 0. For all scalar A, one has 0 < 
φία + ryt + dry) = o(a,z) + A*o(z,y) + Av(y,2) + JAI? olyy) = 
A*o(x,y) +A(y,2). In particular, for A = —¢(x,y), 0 < —2| (x,y) 5. 
Thus, φία,}) = 0, and the desired inequality is obvious. 


Exercises 


1. Let @,Q be pre-Hilbert spaces, and suppose the pre-Hilbert 
spaces ®* and Q* are constructed as in Hxample 8. The following 
conditions are equivalent: 

(a) J: Φ — © is conjugate-linear, and (Jz|Jy) = (y|x) for all 
zy & 6. 

(Ὁ) J: ® — Q* is linear, and [Jz|Jy] = (x|y) for all z,y ΕΞ @. 

(c) J: @* — Q is linear, and (/z|Jy) = [z|y] for all zy € ΦῸ, 
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If ® = ὃ, 8 surjective mapping J satisfying condition (a) is ealled 
a conjugation of the pre-Hilbert space ®. For example, J(Ax) = (Ax*) 
is a conjugation in the Hilbert space 13, 

2. Let ὁ be a Hilbert space. Show: 

(i) The mapping U: σὺ — (3¢’)* defined by Uz = 2’ is a vector 
space isomorphism. 

(ii) The Banach space 3C’ becomes a Hilbert space, on defining 
(x’|y’) = (y|z). 

(iii) If 20’ is the Hilbert space described in (ii), (5¢’)* becomes a 
Hilbert space on setting [x’|y’] = (y’|2’). The mapping J: σὺ > 50, 
defined by Jz = 2’, satisfies condition (a) of Hzercise 1. 

(iv) With the Hilbert space structure of (5¢’)* defined as in (iii), 
the mapping U of part (i) is a Hilbert space isomorphism. 


3. If Ὃ is a vector space, the vector space (U*)* is entitled to be 
called equal to the vector space VU. Analogously for normed spaces 
and pre-Hilbert spaces (see Examples 6 and δ). 


4 If g: Ὃ X Ὁ — Wis bilinear or sesquilinear, the “parallelogram 
law” holds: 
g(t + ye + y) + φίω — νι, — y) = 29(z,z) + 2oly,y). 
5. If g: VU X Ὁ — W is bilinear, then 
oe + yz + y) — φία — yt — y) + 
ἰφία + ἵν + ty) — ἰφία — ty,x — iy) = θ. 


6. If U,0,W are vector spaces, the set $(U,U; W) of all sesquilinear 
mappings ¢: U X Ὃ — W isa vector space relative to the operations 


(e+ ¥)(z,y) = o(z,y) + ¥(z,y) 
(Ag) (x,y) = Ag(a,y). 


The identity mapping ¢ — ¢ is a vector space isomorphism of 
$(U,0; W) with B(U,0*; W). [See Exercise 3.4.] 


7. Let Ὃ be the vector space of 2-ples « = (λ1,λ2), y = (u1,mM2),°*°. 
Define o(z,y) = Ayue, ¥(z,y) = ou. Then, ¢ and y are bilinear forms, 
g(z,2) = ¥(z,x) for all z, but φ ¥ ψ. 


8. Let U,U be vector spaces. If yg: UX UV — @ is a sesquilinear 
form, define gf: U X U — @ by the formula ¢*(y,z) = [y(z,y)]*. 
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Then: 
(i) φῇ is ἃ sesquilinear form on Ὃ X ἅι. 

(ii) ¢ — φῇ is a bijective conjugate-linear mapping of $(U,U; 6) 
onto 8(0,U; 6). 

(iii) If Ὃ = U, φῇ = gif and only if ¢ is Hermitian. 

9. Let Φ and ὃ be pre-Hilbert spaces, J: ® — Q a mapping such that 
(Jx|Jy) = (y|x) for all z,y ΕΞ Φ. Prove: 

(i) If J(@) is a linear subspace of Q, then J is conjugate-linear. 

(ii) In particular, if @ = Q and J is surjective, then J is a conjuga- 
tion of ® in the sense of Lzxercise 1. 

(iii) If Φ = 50 is a Hilbert space, and ὦ (00) is a total subset of Ὁ, 
then J is surjective. 

10. Let ¢ be a positive sesquilinear form on U X VU, and let 1 = 
{fe Ε VU: o(2,2) = 0}. Then, 2 = {x ΕΞ Ὁ: g(x,y) = O forall y € V}, 
hence 9 is a linear subspace of Ὅ. 

11. If ¢ is a positive sesquilinear form, under what conditions on 
wand y does | (x,y) |” = 9(x,x)e(y,y) hold? 


§6. BOUNDED SESQUILINEAR MAPPINGS 


Definition 1. If &,G are normed spaces, a sesquilinear mapping 
g: & X § — G is said lo be bounded in case there exisis a constant M 
such that || 9(x,y) || < M || x || || y|| for αἷϊ « € ὃ, y € σ. If moreover 
G = δ, ¢ its called a bounded sesquilinear form. 


In other words, ¢ is a bounded sesquilinear mapping of ὃ X δ into 
G, if and only if it is a bounded bilinear mapping of ὃ X δ into G (see 
Definition 4.1, Example 5.6, and Theorem 5.1). The number 


LUB {|| e@y) |i: el Sly ll <u} 
is denoted |! ¢ ||, and is called the norm of φ. By Theorem 4.1, 
Theorem 1. Jf o:& Χ δ᾽ — Gis a bounded sesquilinear mapping, 
(1) || el] = LUB {|| e@y) |: |e] <1, yl « 1) 
= LUB {|| o(z,y) Il: | zl] = 1, ly ll = 1}; 
2) lle@yi<llelliziiyllforalecsé yes; 


(3) af M>0 and | g(x,y) || < M || x|| || y || for all eC 8 and 
y & &, necessarily || ¢ || < M. 


ν 6 Continuous Linear Forms 129 


Examples 


1. If @ is a pre-Hilbert space, the sesquilinear form φ ΦΧ Φ — 6 
defined by ¢(x,y) = (x! y) is bounded, by the Cauchy-Schwarz inequal- 
ity. Assuming © ~ {6}, || o |] = 1. 


2. If ὃ is a normed space, Φ is a pre-Hilbert space, and Τὶ ὃ — @ 
is a continuous linear mapping, then ¢(z,y) = (T'x|y) is a bounded 
sesquilinear form, with || ¢ || = || 7 || (see Hxample IV.7.6 or Exer- 
cise IV.7.4). 


3. If &F are normed spaces, and φὶ ὃ X ¥ — @ is a bounded 
sesquilinear form, the sesquilinear form Ψ:  Χ ὃ — @ defined by 
¥(y,x) = [y(z,y)]* is also bounded, and || ¥ || = || ¢||. ἢ 


The following result plays an important role in the exposition of 
spectral theory given in Chapter VIII: 


Theorem 2. If @ is a pre-Hilberi space, the norm of a Hermitian bounded 
sesquilinear form φὶ Ὁ X Φ — Ὁ is given by the formula 


ll ¢ || = LUB {| ¢(z,2)|: || x || < 1]. 
Proof. 

By assumption, ¢(2,x) is real, for all x (see Theorem 5.4). 

If || x || < 1, |e(a,)| < || ¢ || by the definition of || ¢ ||; hence the 
indicated LUB is a finite real number M, and M < || ||. Clearly 
|e(z,z)| < M||2z|| || z|| forallz ce. 

Fix x,y € Φ, with || x |] < 1, |! y |] < 1. It will suffice to show that 
ly(z,y)| < Μ. 


Case 1. Suppose y(z,y) is a real number. By Theorem 5.2, ¢(x,y) = 
Hola + yc + y) — φία — για — y) + ie(x + ἐμ, + ty) — ἰφᾷ -- 
ἔμ, — ty)}. Since ¢(z,z) is real for all z, and since ¢(z,y) is by assump- 
tion real, necessarily o(z,y) = Fi g(x + νιν + y) — φία — y,« — y)}. 
Quoting the parallelogram law at the appropriate step, | ¢(x,y)| < 
Flo@ + yz +y)| + lo@ —y2 —y|} < {mM || 2 + yl? + 


Mie --νἹ} == 2}. Ἐ2}ν 1} <2 12+2} = Μ. 


Case 2. In general, one can write | ¢(x,y)| = λφία,"), where ἃ is 8 
suitable complex number of absolute value 1. Then, ¢(Az,y) = 
Ag(z,y) = | e(a,y)| is real; since || Ax || = |A|- | zl] = || z]] <1, 
| e(x,y)| < M by case 1, thus | ¢(z,y)| <M. | 
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Exercises 


1. If 8,5.9 are normed spaces, the set 8.(&,; §) of all bounded 
sesquilinear mappings ¢: ὃ X 5. — G isa linear subspace of S(6,5; 6), 
and becomes a normed space on defining || ¢ || as above. The identity 
mapping ¢ — ¢ is an isometric vector space isomorphism of &.(&,5; §) 
with @,(8,5*; 9). (See Exercises 5.6 and 4.4) 


2. Let & be a normed space, Φ a pre-Hilbert space. Each continuous 
linear mapping ΤΊ ὃ — Φ determines a bounded sesquilinear form 
oni & ΧΦ — C@,namely g, (x,y) = (Tx|y), and one has || g, || = || 7'|| 
(see Example 2). Then, T — gp is an isometric linear mapping of 
L£,(8,P) into 8ι(δ,Φ; ©) (see Exercise 1). 


3. If Pisa pre-Hilbert space, 6 isa normed space, and g:® X 0 > 6 
is a bounded bilinear mapping such that g(y,z) = ¢(z,y), then 
ll ¢ || = LUB {|| o@,«)|\: |x|] < 1}. 


4. Let @ be a pre-Hilbert space, and define ¢(x,y) = i(x|y). Then, 
y is a bounded sesquilinear form, || ¢ || = LUB {| ¢(a,z)|: || «|| < 1}, 
but » is not Hermitian. 

5. Let @® be a pre-Hilbert space, and suppose U: Φ — @ is a 
continuous linear mapping such that || U || = 1 and Uz = z for some 
20. Define φρίτ,") = (Uzly). Then, || gy, =||Ull=1= 
LUB {| ¢,(2,2)|: || x || < 1}, but ¢, need not be Hermitian. 


§ 7. BOUNDED SESQUILINEAR FORMS IN HILBERT SPACE 
As a special case of Example 6.2, 


Theorem 1. If 20 and % are Hilbert spaces, and ΤΊ 3 — ὅς is a con- 
tinuous linear mapping, the formula ¢,(2,y) = (Tx|y) defines a 
bounded sesquilinear form φη, on KH X K, such that || vp || = || T lI. 


The point of stating Theorem 1 is that its converse is true: 


Theorem 2. If 3¢ and % are Hilbert spaces, and φὶ δῦ X K > Cia 

bounded sesquilinear form, there exists one and only one continuous 

linear mapping ΤΊ 532 — K such that o(x,y) = (1 1 ν) for all τ € &, 

γῈ We Ee siete of τομὰς ἡ σαν δὴ loll =lel= 
T |\. 
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Proof. 

Given x € σύ, the problem is to define Tz € K. Now, y — [¢(z,y)]* 
is a linear form fz on K, and |fz(y)| = |[o(z,y)]*| = |¢(z,y)| < 
ll ¢ {| || x |] || y |] shows that f, is continuous, and || fz || < || ¢ || || z {. 
By Theorem 1.1, there is a unique vector z ΕΞ K such thatf,(y) = (μ| 2) 
for all y € K. Defining Tx = z, one has (y| 71) = f(y) = [y(x,y)]*, 
thus ¢(z,y) = (Tx|y). Incidentally, || Tz || = || f.|| by Example 
IV.7.4, thus || Tx || < || ¢ || || x |]. It will now be shown that the map- 
ping 7’: σὺ — & is linear. 


T is additive: given any 11,1. € 50, T(x; + 2%) = Tx, + Tze. For, 
given any y © K, (T(t + 2)/y) = ot + 22,y) = φίαι,") + 
o(%2,y) = (7 αι }}) + (Txe|y) = (Tx, + Txe|y); see Theorem 11.3.1. 

T is homogeneous: If x € 3H, ἃ is scalar, and y € K, (T(Az)|y) = 
φίλα") = λφία,") = MTx|y) = (A(T 2)|y), hence T(x) = (Tz). 

T is unique: Suppose S: σὺ — οὗ is any mapping such that (Sz|y) = 
o(x,y) for all z € KH, y€ K. Then, (Tx — Sz|y) = 0 for all x € 50, 
y € K; hence Tx — Sx = θ for all z € &, that is, T =S. 


Since | Tx || < || ¢|| || {ϑ, the linear mapping T is continuous. 
Since ¢ = Pry ll ¢ || -- | T || by Theorem 1. is 


Exercises 
1. If % and & are Hilbert spaces, the correspondence T — ¢,, 
described in Theorem 1 is an isometric vector space isomorphism of 
£L-(5C,K) with $,(5C,K; 6). [See Hxercise 6.2.] 


2. If ὃ is a normed space, and & is a Hilbert space, generalize 
Theorem 2and Exercise 1 for bounded sesquilinear forms 9:8 Χ K — @. 


§ 8. ADJOINTS 
Throughout this section, 3C,K,£ denote Hilbert spaces, 
Theorem 1. Jf 7: σὺ — & is a continuous linear mapping, there exists 
one and only one continuous linear mapping T*: K — 5 such that 
(Tx|y) = (z|T*y) for allz € HK, y ΕΞ K. One has || T* || = || T |. 
Proof. 


Define ¥(y,x) = (y| Tx). Clearly y is a sesquilinear form on K X 20, 
and |¥(y,x)| = |(y|Tx)| = |(y| Tz)*| = |(Tz|y)| shows that y is 


132 Introduction to Hilbert Space ν §8 


bounded and |} ¥ || = || Τ' || (see Theorem 7.1). By Theorem 7.2, there 
exists a continuous linear mapping ΤῊ: K — 3¢, with || 7* || = || ¥ ||, 


such that ¥(y,xz) = (T*y|x) for all y C &, « € K. Thus, (y|Tz) = 
(T*y|z), (Tz|y) = ΕἸ ΤῊ). } 

If S: K — 5C is any mapping such that (Tx|y) = (x|Sy) for all 
2 €& 5, y & ®, then (αἰ ὅν — ΤῊ) = 0 for all x and y, hence Sy — 
T*y = 0 for all y; that is,S = ΤῸ J 


Definition 1. Notation as in Theorem 1; T* is called the adjoint of T. 


The rest of the section is devoted to proving properties of adjoints 
which will be used often in the sequel. 


Theorem 2. If S: 905 — K and ΤΊ KH -- K are continuous linear 
mappings, and d is scalar, 


(1) (SS+T7)* =S8*+T* 
(2) (AT)* = A*T* 
(3) (T*y|z) = (y|Tz) foralyC K, eC HK 
(4) (T*)*=T 
(5) | T*T || =| TT* = TIP 
(6) T*T = 0 ¢f and only if T = 0. 
Proof. 
(3): (ΤῊ) = (a| T*y)* = (Tx|y)* = (y| Tz). 


(i): (SS + T)*y|z) = (y|(S + Tx) = (Se + Tz) = (y|Sz) + 
(y| Tx) = (S*y|x) + (T*y|x) = (S*y + T*y|x) = ((S* + T*)y|z). 


(2): (AT) *y| x) = (y| AT)2) = (y|A(P2)) = A*(y| Tz) = A*(T*y| 2) 
= (A*(T'*y)|z) = (A*T*)y|2). 


(4): ((T*)*z|y) = (2|T*y) = (10). 


(5): Observe that the composites ΤῊΤ: σὺ — Sand TT*:K — σὲ 
are defined. If x€ 50, || a|| <1, one has || Tx ||? = (Tzx|Tz) = 
(T*Tx|z) < || T*Tx || || « [5 [|| T*T2 || < || T*T ||; taking LUB 
over || a || <1, || TI? < || T*7 ||. By Theorem IV.9.1, || T*T || Ss 
| 7* || | TI =| TI? Thus, || ΤῊΤ ||? = || T ||’. Replacing T by T*, 
|| (Ὁ "ΤῸ || = || T* ||?, thus || TT* || = || T II’. 


(6): follows at once from (5). ἢ 
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Theorem 3. Jf Τ': σ( — KandS: K — & are continuous linear map- 
pings, then (ST)* = T*S*. 


Proof. 

Ifz€ £ and z € K, ((ST)*z|z) = @|GST)xz) = @|S(Tz)) = 
(S*z| Tx) = (T*(S*z)|z) = ((T*S*)z|z). J 
Theorem 4, Let 7’: σὺ — δὲ be a continuous linear mapping. If § C K, 
6 K, and T(S) C5, then Το ἢ c 8+. 
Proof. 

Given z © & with z 5, the problem is to show that ΤΣ 1 8. If 
xz Ε 8, then Tx € 3, hence 0 = (z|Tx) = (T*z|z). J 

In particular, 
Theorem 5. Suppose T: 3 — δ is a continuous linear mapping, M is 


a closed linear subspace of 5C, and It 18 a closed linear subspace of K. 
Then, T(M) τ 2 tf and only if T*(+) C m+. 


ee c m+, then (ΤῸ ἐσ Ὁ) Cc οι. Ὁ by Theorem 4; quote 
Theorem III.6.2 and part (4) of Theorem 2. _ | 
Theorem 6. Jf T: 5C — X is a continuous linear mapping, 

(1) {a: Tx = 0} = [T*(K)]* 

(2) {a: Τὰ = 0}+ = T*(K) 

(3) {y: Τὸν = 0} = [T(3)]* 

(4) {y: T*y = 0}*+ = ΤΩ. 


Thus, the null space of T is the orthogonal complement of the closure of 
the range of ΤῸ, 
Proof. 

Let 911 = {2:7'x = 6}; 511 isa closed linear subspace of δῦ. Similarly, 
% = {y: ΤῊ = 6} is a closed linear subspace of K. 

Since Τ (910) € {6}, one has T*({e}+) Cc σι. by Theorem 4; thus, 

(i) T*(K) ἃ m+. 
Let δ, T(K); the relation T(5C) C T(3) can be written 
T(S) € 3, hence T*(5+) C 8+. Since $+ = 3¢*+ = {6}, one has 7*(5+) 
= {9}, 3+ CH, thus 

(ἢ) [Το] oN. 
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Applying these results to 7'’* in place of 7’, one has 

(Ὁ) T(5e) Co 

(15) [T*(K)]* CM. 
From (i), st ¢ σι Ὁ ὁ τ [7'*(K)]*; combining this with (Ὁ), 

(1) m = [T*(K)]*. 
(Incidentally, 9% = 9v+~ results formally from these calculations; see 
also Theorem III.6.2.] 


From (1), s+ = [7*(K)]**; quoting Corollary 1 of Theorem 
111.6.8, 


(2) m+ = T*(K). 
The relations (3) and (4) result on replacing T by Τ᾿ in the relations 
(1) and (2). ἢ 


1. Let Φ and Q be pre-Hilbert spaces, and suppose 7: Φ — Q and 
5:8 — Φ are mappings such that (T'x|y) = («| Sy) for all ᾧ ΕΞ Φ and 
y © 8. Then, S and T' are linear. 


2. If σὺ is a Hilbert space, (S,7) — ST™* is a bounded sesquilinear 
mapping of £,(3C) X £.(5) into £,(5C). So is (5,7) — TS. 


3. Let 50 be classical Hilbert space, 7: 3¢ — 50 a continuous linear 
mapping. Given an orthonormal basis e, for 5%. Let Te, = Di jC; 
be the expansion of 7'e; in terms of the given orthonormal basis (see 
Theorem 11.8.1). The array (a;,) is called the matrix of T relative to 
the given orthonormal basis (see also Exercise JV .1.6). Prove: 

(i) (Tex|e;) = age 

(ii) If ΤῊ has matrix (6;,) relative to the same orthonormal basis, 
then 8; = a,;*. So to speak, the matrix (8;;,) 15 the “conjugate trans- 
pose” of the matrix (a;,). 


4. Let 90 and & be Hilbert spaces, Τ᾽: σὺ — & a continuous linear 
mapping. Prove: 

(i) T’y’ = (T*y)’ for ally ΕΞ & (see Exercise IV .10.7 for the defini- 
tion of T’). 

(ii) If 0: 50 — SC’ is the conjugate linear isometric mapping Uz = 
xz’ deseribed in Theorem IV.10.2, and similarly V: K — &’ is the 
mapping Vy = y’, then 7* = U~'T’V. 
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(iii) Identifying 35¢ with (5¢’)* via the Hilbert space isomorphism 
« — x’ (see Exercise 5.2), and similarly identifying * with (x’)*, one 
has ΤῊ =  Τ'͵ 


5. Let i and & be Hilbert spaces. Form the direct product Hilbert 
spaces JC X K and K X Ie (see Hxrercise 3.11). Define W: 5 K αὶ - 
K X I by the formula W(z,y) = (y,—2z). Prove: 

(i) W is a Hilbert space isomorphism. 

(ii) If ΤΊ σὺ — K is a continuous linear mapping, the graphs of T 
and ΤῊ satisfy the following relation: Gre = [W(Gr)]* (see Exercises 
3.10 and 3.13). 

What is the formula for W*(y,x7)? 


*6. Let iC be a Hilbert space, Φ a pre-Hilbert space, and suppose 
Τ: — Φ and S: Φ — SC are mappings such that (Τα) = (x|Sy) 
for all x € σὺ and y € @. One knows that S and T are linear by 
Exercise 1. It can be shown that S and 7' are continuous. If moreover 
@ = Kis a Hilbert space, then S = 7*. 

7. Let @ be the pre-Hilbert space of finitely non-zero sequences, and 
let e¢, be the canonical basis for Φ (see Example 1.7.8). There is a 
unique linear mapping 7: Φ — © such that Te, = kez for all k. 
Prove: (Tx|y) = («| Ty) for all τ ΕΞ Φ, but T is not continuous. 
Compare this result with Fzercise 6. 

8. This exercise is preliminary to Exercise 9. A doubly indexed 
family of real numbers aj, > Ὁ (7 = 1,2,3,---; & = 1,2,3,---) is said 
to be summable in case there is a constant Ἡ > 0 such that 


Lim > Se air ἘΜ 


for all indices m and n (equivalently, the sum of any finite number of 

the aj, is <M). One then writes Dus p Mk for the LUB of all such 

finite sums. If aj, > 0 is a doubly indexed family, the symbol 

Du; . Gk < © signifies that the family is summable. 

" Show that for a family aj, > 0, the following conditions are equiv- 
ent: 


(a) Das αἷς < το. 


(Ὁ) For each 7, > aj, < ©, and pis (= ain) < 0, 


(c) For each k, pas Qj, < ©, and a (x, ain) < ὦ. 
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If either (hence all) of these conditions is fulfilled, one has 


¥, (¥en) = D,(Zyen) = Dynan 


9. This exercise is preliminary to Exercise 11. Denote by 5 the set 
of all doubly indexed families x = (Aj) of complex numbers such that 
Din! el? < @ in the sense of Exercise 8. If y = (ujx) is another such 
family, define x = y in case λρκ = 45x for all j,k. With operations 
suitably defined, σὺ is a Hilbert space. 


10. Let 320 and & be classical Hilbert spaces, e, an orthonormal 
basis for 50, and f, an orthonormal basis for K. If T: σὺ — Kisa 
continuous linear mapping, one has >, || Tex ||? < οὐ if and only if 
DX, ΠΤ ||? <=, and in this case one has 


dX, ll Lee ll? = ΣΣ, 1 ΤῊ} = ΣΟ, | (Pend) FP. 


11. Let 32 and & be classical Hilbert spaces, 6, a fixed orthonormal 
basis for 3C, f, a fixed orthonormal basis for K. A continuous linear 
mapping ΤΊ 0 — Kis said to be of Hilbert-Schmzdt class if there exists 
an orthonormal basis g, of 3¢ such that >», || Tx ||? < οὐ. Briefly, 
T is of HS-class. Prove: 

(i) If ΤΊ 90 -- & is of HS-class, then ee \| Tx ||? < © for every 
orthonormal basis gn of 3¢, and the value of this sum is independent 
of the particular orthonormal basis. 

(ii) If S:3 — Kand T: σὺ — *® are of HS-class, so are S + Τ' 
and X17’. The continuous linear mappings Τ᾽: 50 -- of HS-class form 
a linear subspace £j5(5C,K) of £.(5,K). 

(iii) A continuous linear mapping 7: δ — X is of HS-class if and 
only if ΤῊ: K — is of HS-class. 

(iv) If T: 50 — & is of HS-class, and &: σὺ — K, S:K — K are 
continuous linear mappings, then ΤΊΝ: σὺ -- K and ST: KH — K are 
of HS-class. 

*(v) If ΤΊ 50 -- & is of HS-class, define aj, = (Tex.|f;). The cor- 
respondence 7’ — (a,x) is a vector space isomorphism of Lps(5C,K) 
with the Hilbert space described in Exercise 9. . 

*(vi) If Sand T are of HS-class, define (51 ΤΊ = Σὺ, (Sex| Tex). 
Then, Lrs(5C,K) becomes a pre-Hilbert space, and the scalar product 
(S| ΤῊ is independent of the particular orthonormal basis employed 
in its definition. The mapping Τ' — (a;,) described in (v) is an 
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isometric vector space isomorphism; it follows that £;,(5C,H) is a 
Hilbert space, and the mapping T — (a;x) is a Hilbert space 
isomorphism. 


12. If @ is an algebra, a mapping a — a* of @ into @ is called an 
involution in case: (1) a** = a, (2) (a + δ)" = αἴ + b*, (3) (ab)* = 
b*a*, and (Aa)* = A*a*, for all a,b ΕΞ @ and scalar ἃ. An algebra with 
involution is also called a *-algebra. A normed algebra with involution 
is called a normed *-algebra. Prove: if @ is a normed *-algebra such that 
|| a*a || = || α ||? for all a € ἃ, then || a* || = || a |]. 


13. This exercise is preliminary to Exercise 14. If @ is an algebra, a 
subalgebra of @ is a linear subspace ὃ such that: if a ΕΞ 8 and b € 8, 
then ab € 8. Prove: 

(i) In a normed algebra, the relations a4, — a and ὃ, — ὃ imply 
Andn — ab. 

(ii) If 8. is a subalgebra of the normed algebra @, its closure § is a 
closed subalgebra of @. 

(iii) If @ is a Banach algebra, every closed subalgebra of @ is itself 
a Banach algebra. 


14. If ἃ is a *-algebra, a *-subalgebra of @ is a subalgebra $ such 
that a € 8 implies αἴ € 8. 
(i) Suppose @ is a normed *-algebra such that || a* || = || a || for all 
a € ἃ. Then, a, — a implies a,* — αὖ, 
(ii) If ἃ is as in part (i), and 8 is a *-subalgebra of @, its closure 8 
is also a *-subalgebra of @. 


15. A B*-algebra is a Banach *-algebra @ such that || a*a || = |! α ||? 
for all a € @. Prove: 
(i) If 8 isa *-subalgebra of a B*-algebra @, then 8 is a B*-algebra. 
(ii) If 7 is a Hilbert space, and 8 is a closed *-subalgebra of £,(5C), 
then 8 is a B*-algebra. 


*16. There exists a converse for part (ii) of Exercise 15, known as 
the “theorem of Gelfand-Neumark”: if @ is any B*-algebra, there 
exists a Hilbert space 5C, and a mapping ¢: @ — £&,(3), such that 

(1) φία - δ) = ofa) + 9(b) 

(2) φίλα) = Ag(a) 

(3) g(ab) = φ(α)φί() 

(4) || φί(α) || = lel] 

(5) ¢(a*) = [y(a)]* (= the adjoint of φ(α)). 
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It follows that 8 = ¢(@) is a B*-algebra of the kind described in part 
(ii) of Exercise 15; hence, @ is essentially “equal” to a B*-algebra of 
this kind. 

17. Notation as in Exercise 11, with σὺ = σὺ, Then, £,,(5C,5C) is a 
*-subalgebra of £,(5¢). 


18. An involution in a vector space Ὃ is a mapping + — σῇ of Ὃ into 
Ὃ such that: (1) af? = a, (2) (ὦ -Ἡ υ) = σῇ + y?, and (3) (λα) = 
*r?, Prove: 

(i) If U is a vector space, and φ: Ὁ X Ὃ — Ὁ is a sesquilinear 
form, define ¢? (x,y) = [¢(y,x)]*. Then, ¢ — φῇ is an involution in the 
vector space 8(U,U; 6) (see Hxercise 5.8). 

(ii) Let 5¢ be a Hilbert space. If φ, is defined as in Theorem 7.1, 
then (yp)? = gps. 


Cha pter 


Operators in Hilbert Space YI 


§ 1. Manifesto 

§ 2. Preliminaries 

§ 3. An example 

§ 4, Isometric operators 

§ 5. Unitary operators 

§ 6. Self-adjoint operators 

§ 7. Projection operators 

§ 8. Normal operators 

§ 9. Invariant and reducing subspaces 


§ 1. MANIFESTO 


For the rest of the book, we settle down to the study of various 
types of continuous linear mappings of a Hilbert space into itself. 
Generalities are henceforth banished to the exercises, and the reader 
is invited to assess each definition and proof for its possible level of 
generality. For example, some statements about Hilbert space will 
make sense for normed spaces, or vector spaces, and so on; observa- 
tions of this sort will be found in the exercises. Henceforth: 


1. 50 denotes a Hilbert space + {6}. 


2. Any continuous linear mapping T: σὺ — JC will be called an 
operator. In particular, J is the zdentity operator, 0 is the zero operator, 
XJ is a scalar operator. 


3. ΤῊ denotes the adjoint of the operator 7’. Thus, 
(i) Τ᾿ is an operator 
Gi)  (Tzx\y) = (e|T*y) for all zy ΕΞ KH 
(iii) T*=T 
(iv) (9 + T)* = 8. +7" 
139 
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Ὁ) 017)" -- λ"7" 
(vi) (ST)* = T*S* 
(vii) || T*|| =| T } 
(viii) || T*T || = || T II? 
(ix)  T*T = Oif and only if T = 0. 


These facts about adjoints are assumed to be known to the reader | 


(see ὃ 8 of Chapter V); they will be used in the sequel without explicit 
reference. 
The special types of operators 7 which will be discussed in this 

chapter are defined as follows: 

isometric operator: T*T = I 

unitary operator: ΤῊ =TT*=T] 

self-adjoint operator: T* = T | 

projection operator: TT = T and ΤῊ = T 

normal operator: ΤῊ = TT*. 


Other types of operators (positive, invertible, etc....) are diseussed 
in the exercises. 


§ 2. PRELIMINARIES 


The theorems of this section will be used often, and without explicit 
reference, in the sequel. 


Theorem 1. The null space I of an operator T is a closed linear sub- 
space of δ. 
Proof. 

See Theorem IV.7.2. ἢ 


Theorem 2. If 8 is a total subset of 3C, and S,T are operators such that 
Sx = Tx for all x € 8, then S = Τ. 


Proof. 

Let % be the null space of S — T. By assumption, $ C MN. Since $is 
total, so is 9, hence ὃ1 = {6} (see Lxample III.2.1). By Theorem 
11.6.2, σι = σ΄ = {o}+ = H; thatis, S-T=0. ἢ 


Theorem 3. If S and T are operators such that (Sx|x) = (ΤῈ for all 
z€ K,thenS = T. 
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Proof. 

Consider the sesquilinear forms ¢ and wy defined by the formulas 
y(x,y) = (Sxly), ¥(a,y) = (Tz|y) (see Example V.5.2). By assump- 
tion ¢(z,z) = ¥(z,x) for all z, hence ¢ = Ψ by Theorem V.5.3. Thus, 
(Sz|y) = (Tx|y) for all z and y; Sx = Tx results from Theorem 
11.3.1. J 


Exercises 


1. If X is a set, Φ is a pre-Hilbert space, and S,7' are mappings of 
x into Φ such that (Sz|y) = (Tx|y) for all x ΕΞ X and y € @, then 
S = Τ, 


2. If T is an operator such that Tx is orthogonal to x, for every 
vector z, then 7 = 0. 


3. If StS + T*T = 0, then 8 = T = 0. 


4. Let S and T' be operators, $ a total subset of 20. Show: 

(i) If (Sx|y) = (Tx! y) for all zy € 8, then S = Τὶ, 

(ii) It can happen that (81:1) = (Tx|x) for all ᾧ Ε 8, and yet 
S x T. 


5. Theorem 3 holds for (not necessarily continuous) linear mappings 
in pre-Hilbert space. 


6. In a normed space ὃ, call a subset 85 éofal in case: the only 
continuous linear form f on ὃ such that f(z) = Oforallz € 8, 15 = 0. 
Then: 

*(i) It can be shown that if 8. is a total subset of ὃ, then & is the 
smallest closed linear subspace of & which contains 8. Theorem 2 then 
generalizes to normed spaces. 

(ii) In Hilbert space, a subset $ is total in the above sense, if and 
only if it is total in the sense of Definition 11.8.1, 


7. Let S be a subset of a pre-Hilbert space Φ. 

(i) If 8. is total in the sense of Exercise 6, then it is total in the 
sense of Definition 11.8.1. 

(ii) The converse of (i) fails. 


83. AN EXAMPLE 


The theorem of this section is a useful source of examples of op- 
erators. Suppose % is classical Hilbert space, and 2, is an orthonormal 
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basis for 50. Thus, every x ΕΞ 3C has a unique representation z = 
>, Aer, Where Dor |Az|? <, namely Xz = (z|2x) (see Theorem 
11.8.1). 


Theorem 1. Let un be a bounded sequence of complex numbers, M = 
LUB {|px|: & = 1,2,3,---}. There exists one and only one operator T 
such that 


(1) Tay = wate for all k. 


Moreover, 
ὦ) ΤΣ: nm) = Dy Awe 


(3) ||T\_ Ξ 

(4) T*r, = ue*xe for all k 

(5) T* (x: dite) = DO, Mere 

(Ὁ ΤῊ a TT. 

Proof. 

(1),(2),(3): Given x € 5¢, say x = Σὺν λεῖα; the problem is to de- 
fine Tx. Since Dy ene? < M? Dy el? = M* || x ||? (see Theorem 
11.7.2}, one can define Tx = >, Mute (see Theorem 11.7.1). The 
linearity of 7 results from the Lemma to Theorem III.3.2. Since 
| Tx ||? < M? || x||?, T is continuous, and || T || < M. 

Clearly Τὰ = were; since || x || = 1, || T || > |] Tze |] = || were |] = 
[με] for all k, hence || 1. > M. 

(4),(5): Suppose « = S07 da, and T*x = Σὰν »4.2,; the problem is 
to show νὰ = Axuz*. For all k, vy, = (Τ᾿ 2%) = (| Taz) = (αἰ wpry) = 
wE*(x| awe) = we*rr. 

(6): Clearly T*T'x, = |ux|2x, = TT*x,; since the 2, are total, 
"7 = 27. 


Uniqueness: If S is any operator such that Sx, = up, for all k, then 
Sx; = Tx, for all k; since the τὰκ are total,S=T. J 
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Exercises 


1. Notation as in Theorem 1. For each n = 1,2,3,---, let Tn be the 
unique operator such that T,7% = mer, for k <n, and T',x, = θ for 
k > n (the existence of 7, is a special case of Theorem 1). Then, 
T,, — T strongly, in the sense of Lzercise IV .8.2. 


2. If & and § are normed spaces, S: ὃ — δ is a continuous linear 
mapping, x € ὃ, and a = Σὸν 2» in the sense of Definition 11.7.1 


(generalized to normed spaces), then Sa = sam Sx. 


ὃ 4. ISOMETRIC OPERATORS 


According to Definition IV.7.3, an operator T' is isometric in case 
| Tx — Ty || = || 2 — y || for all σὺν € 3; by Theorem IV.7.5, this is 
equivalent to the condition || Τὰ || = || x || for all x. In Hilbert space, 


Theorem 1. The following conditions on an operator T are equivalent: 
(a) T is isometric 


(b) T*T =I 
(ec) (Tx|Ty) = (ely) for all x,y € HK. 
Proof. 


(a) implies (Ὁ): By Theorem IV.7.4, || Tx || = || a || for all 2, hence 
(T*Tx|x) = (Tx|Tx) = || Tx |? = || x [5 = @lz) = (15). 

(Ὁ) implies (c): (Tx| Ty) = (T*Tx|y) = Ux|y) = (ey). 

(ὁ) implies (a): || Tx ||? = (Tx|Tx) = (x|x) = [ [1 [5.1 

If T is isometric, it is clear from the relation || Tx — Ty || = 
| 2 — y || that 7 is injective. 
Theorem 2. The range T(5C) of an isometric operator T is a closed 
linear subspace of SC. 
Proof. 

Τ (50) is a linear subspace, by Theorem IV.1.4. Suppose y is adherent 
to 7'(5C); the problem is to show that y ΕΞ Τ (50). Choose any sequence 


Yn € T(SC) such that yn > y. Say yn = Τὰν. Since || tm — Zn || = 
|| Tm — Tn || = || Ym — Yn || — 0, a, is a Cauchy sequence. Since 
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50 is complete, x, — Ζ for suitable z. By the continuity of Τ', Tz, - 
Tx, that is, Tz = limy,=y. J 

Example 1. Let 3 be classical Hilbert space, and 21,72,73,--- an 
orthonormal basis for 3¢. There is a unique operator 7 such that 
Tx, = X%+, for all k. Specifically, every vector z has a unique repre- 
sentation z = bg etx, and one defines 7 = pa Ante41. Clearly T 
is linear, and || Tz ||? = Σ, [δὲ \* = || z [3 shows that Τ' is isometric, 
T will be referred to as a one-sided shift operator. 

The effect of ΤῊ is as follows: ΤΣ] = 6, and T'*z,; = x,_, for k = 
2,3,4,-++. For, (Τ᾽ τι [2 = (a1|T2j) = (a1 |2j41) = 0 = (011) for 
j = 1,2,3,:+-. If & > 1, (Τ᾿ χε} τῇ = (rel 2541) = 1 when j = k — 1, 
and = 0 for all other 7; thus, (7'*2,|2j) = (,_;|2,) for7 = 1,2,3,-+-. 


Exercises 

1. If S and T are isometric operators, so is ST’. 

2. Operators S and T are said to be metrically equivalent in case 
|| Sx || = || Tx || for all ᾧ € HX. Then, S and T are metrically equiv- 
alent if and only if S*S = T*T. 

3. In Example 1, if 5¢ = 13, and 2, = e, is the canonical ortho- 
normal basis, describe the effect of 7’ and 7'* on the vector 1 = 
(A1,A2,A3; = "). 

4, Let Tx, = xp4. be the one-sided shift operator (Hxample 1). The 
x; are total, and for each k, Tx, is orthogonal to 2. Compare with 
Exercises 2.2, 2.4. 

5. Conditions (a) and (c) of Theorem 1 are equivalent, for a linear 
mapping in pre-Hilbert space. 

6. Theorem 2 holds for an isometric linear mapping of a Banach 
space. 

7. If St and δῖ are closed linear subspaces of 50, Τ' is an isometric 
operator, and 7(9) = 9, then Τίοπ ἢ) C ot". 


8. Suppose S and 7 are metrically equivalent operators, in the | 


sense of Hxercise 2. Prove: 

(i) There exists an isometric vector space isomorphism V: T'(5¢) — 
S(3c) such that V7'x = Sz for all x € σύ. 

(ii) If T(5C) is a closed linear subspace, then S(3C) is also closed. 
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9. Let 5: σὺ — σὺ be a mapping such that (Sx|Sy) = (2|y) for all 
z,y € 5C. Suppose, moreover, that S(5C) is a linear subspace of 3. 
Then S is an isometric operator. 


10. Let T: σὺ — σὺ be a mapping such that (Τὰ Ty) = (y|x) for 
all x,y € 20. Suppose, moreover, that 7'(5C) is a linear subspace of 50. 
Then ΤΊ is an isometric operator. 


11. Let 50 be classical Hilbert space, x, an orthonormal basis for 
ac. An operator 7 is isometric if and only if 7'x, is an orthonormal 
sequence. 


§5. UNITARY OPERATORS 


Definition 1. An operator T is said lo be unitary in case T*T = TT* 
=f, 


Theorem 1. The following conditions on an operator T are equivalent: 
(a) 17 ts unitary 
(Ὁ) T* ds unitary 
(c) T and T* are zsometric 
(d) T' is isometric and T* is injective 
(e) Τ᾽ 18 tsometric and surjective 
(f) T is bijective, and T-' = ΤῊ, 


Proof. 
The equivalence of (a),(b),(c) is clear from the definitions, and the 
relation 7'** = Τ᾿, 


(c) implies (d): Isometric operators are injective. 


(d) implies (e): By Theorem 4.2, T(5C) is a closed linear subspace 
of 3¢. Also, T* is injective; quoting Theorem V.8.6, = {0}+ = 
fy: Το = θ) = Τί 0) = T (3). 

(6) implies (f): By assumption, T is bijective. Let S = 7'—; accord- 
ing to Definition IV.2.2, ST = TS = I. Then, ΤῸ = T*] = T*(TS) 
= (T*T)S = IS = 8. 


(f) implies (a): Clear from the definitions. J 
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Examples 


1. Notation as in Theorem 3.1. Thus, 50 is classical Hilbert space, 
én is an orthonormal basis for 5C, u, is a bounded sequence of complex 
numbers, and 7' is the unique operator such that Τὰκ = yu, for all k, 
One has Τ ΤΩ, = | up|", = TT*zx,. Clearly, T is unitary if and only 
if | ux |’, = Izy for all k, in other words, [με] = 1 for all k. 

2. Let KH, Zn be as in Example 1. Define Yr = Vor+s for k = 
0,1,2,3,---, and y_, = tox for k = 1,2,3,---. Thus, y, is an ortho- 
normal basis indexed by the set of all integers. Every vector x has a 
unique representation ᾧ = > ” , \xyx (the sum can be interpreted as 
the unique vector y such that (y|y;) = ), for all k). There is a unique 


operator U such that Uy, = y+, for all k; specifically, one defines — 


U fol divs) = ees AxYe+1- [Ὁ follows that U*y, = yz_1 (see the 


calculation made in Lxample 4.1). Since U*Uy, = UU*y, = με = 
Iy,, U is unitary; it will be referred to as a two-sided shift operator. 


Exercises 
1. If σὺ is finite-dimensional, every isometric operator is unitary. 
2. An operator 7’ is unitary if and only if ΤῈ is isometric and T is 
injective. 
3. If S and T are unitary operators, so is ST. 


4. If T is a unitary operator, 511 and XN are closed linear subspaces, 
and T(t) = 9, then T(t") = x+. 


5. An operator 7’ is unitary if and only if 7: 50 -- σὺ is a Hilbert 
space isomorphism. More generally, if 3¢ and ® are Hilbert spaces, 8 
mapping 7’: σὺ — X is a Hilbert space isomorphism if and only if: 
T is a continuous linear mapping such that 7'*7 = I (= identity 
operator in 50) and T7'* = J (= identity operator in X). 

6. If 7':5¢C — Kis asurjective mapping such that (Τα Ty) = (| y) 
for all x,y € σύ, then Τ' is a unitary operator. 

7. IfJ and K are conjugations of 3, in the sense of Exercise V4.1, 
then JK is a unitary operator; in particular, JJ is unitary. If J is 8 
conjugation, and U,V are unitary operators, fien UJV is a conjuga- 
tion. 
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8. Let σύ be classical Hilbert space, x, an orthonormal basis for 50, 
and 1’,2’,3",- ++ any rearrangement of the indices 1,2,3,- - -. The unique 
operator 7’ such that Tx, = 2,’ for all’k, is unitary. ο 


9.-G) If 8 is a total subset of σῦ, and T is an isometric operator, 
then 7’ is unitary if and only if Τ᾽ (8) is a total subset of 520. 

(ii) If δ is classical Hilbert space, and zx, is an orthonormal basis 
for δῦ, an operator T is unitary if and only if Τα, is an orthonormal 
basis for 3c. 


10. An operator S is said to be unitarily equivalent to the operator 
T in case there exists a unitary operator U such that T = U*SU. 
Notation: S= T. Prove: 

(i) Τ =T, for every operator Τ'. 

(11) IfS2T, then Τ' 5Ξ 5. 

(ii) If R= Sand S = T, then R = T. Moreover, 
(iv) if S = 7, then S* = ΤῊ, 


§ 6. SELF-ADJOINT OPERATORS 
Definition 1. An operator T is said to be self-adjoint (or ‘‘Hermitian’’) 
in case T* = T. 
Theorem 1. The following conditions on an operator T are equivalent: 
(a) Τ' ds self-adjoint 
(Ὁ) (Tz|y) = (x| Ty) for all x,y € a 
(c) (Tx|xz) = (x|Tzx) forallze € 0 
(d) (T'x|zx) 18 real, for all x € 20. 
Proof. 
(a) implies (b): (Tx|y) = ΤῊ) = (e| Ty). 
(c) implies (d): (Tx|x)* = («| Tx) = (Τα χ). 
(d) implies (a): (Tx|z) = (Tx|x)* = (5 [ Τῷ} = (T*z|z). ἢ 


The next theorem follows easily from the properties of adjoints 
listed in § 1: 
Theorem 2. 

(1) If S and T are se “adjoint, so is S + T. 

(2) If T ts self-adjoint, and a is real, aT is self-adjoint. 
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(3) If T is any operator, T*T and T + T* are self-adjoint. 
(4) If Sand T are self-adjoint, then ST is self-adjoint if and only af 
ST = TS. 
Example 1. Notation as in Theorem 3.1. Thus, Tx, = wets and 
T*x, = p,*2,. Clearly T is self-adjoint if and only if ur, = oe* Te for 
all k, in other words, μ is real for allk. ἢ 
The decomposition of a complex number ἃ in the form A = a + 7, 
where a and β are real, has a generalization for operators: 


Theorem 3. (Cartesian form) If Τ' is any operator, there exist self-ad- 
joint operators A and B such that T = A + iB. Necessarily 


A=lor+T*) 


1 * 
B= ~(T — T*). 


Proof. 

Define A and B by the above formulas; clearly A and B are self- 
adjoint, and A + 7B = T. Suppose also 7’ = C + iD, with C and ἢ 
self-adjoint. Then, ΤῊ = C* + i*D* = C — iD, hence T + T* = 26 
and T — T* = 2D. Thus,C =AandD=B. ἢ 


The following result is important for the discussion of spectral 
theory in Chapter VIII: 


Theorem 4. If T is a self-adjoint operator, 
|| 7 || = LUB {|(Tz|z)|: || 4 }} < 1}. 


Proof. 
"Define o(x,y) = (Tx|y); ¢ is a bounded sesquilinear form (see T’heo- 
rem V.7.1), and || ¢|| = || 7’ ||. Since o(y,z) = (Ty|z) = (| Tx) = 


(Tx\y)* = [y(z,y)]*, ¢ is Hermitian, hence || ¢ || = LUB {| ¢(z,2)|: 
\| x || <1} by Theorem V.6.2. ἢ 


Exercises 


1. An operator T is said to be skew-adjoint in case T* = —T.Show: 
(i) T is skew-adjoint if and only if <7 is self-adjoint. 
(ii) For any operator 7, T — ΤΡ is skew-adjoint. 
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2. If T is a self-adjoint operator, $ and 5 are subsets of 50, and 
T(s) C3, then T(5+) c s+. 


3. If T is self-adjoint, Tx = θ if and only if TTz = 6. Give an 
example of an operator 7' such that T τέ 0 but TT = 0. 


4. If T is self-adjoint, and S is any operator, then S*T7S is self- 
adjoint. 


5. If T is self-adjoint, and S is unitarily equivalent to 7, then S is 
self-adjoint. 

6. (i) If φ is a bounded sesquilinear form on 50 X 50, there exist 
unique bounded Hermitian sesquilinear forms φὶ and g» such that 
e = φι + to. 

(ii) If 0 is a vector space, and ¢ is a sesquilinear form on U X Ὅ, 
there exist Hermitian sesquilinear forms ¢;,g2 such that φ = φῃ + 
1p. 

(in) Generalize (i) to bounded sesquilinear forms ¢ on ὃ X &, where 
δ is 8 normed space. 


7. An operator T is said to be positive in case (Tx|x) > 0 for all 
z€ 50, Notation: T > 0. Prove: 
Gi) If 7 > 0, then 7 is self-adjoint. 
Gi) IfS>Oand7>0, thnS+T7>0. 
(ii) If T > 0 and α > 0, then aT > 0. 
(iv) If 7 > 0 and S is any operator, then S*7S > 0. 
(v) For any operator 7, T*T > 0. 
(vi) IfS>0,7>0,andS+T =0,thenS = T= 0. 
(vii) If 7 > 0, then |(Tz|y)|? < (Τα) (ΤῊ) for all zy € 30. 
(viii) If 7 > 0, then Tz = θ if and only if (Tx|z) = 0. 
It can also be shown that: | | 
*(ix) If T > 0, there exists a unique operator αὶ > 0, such that 
T = SS; S is called the positive square root of Τ΄. Notation: S = νΊ. 
*(x) Let T > 0, S = ’ ν 7. For an operator R, RT = TR if and 
only if RS = SR. 
*(xi) IfS > 0, T > 0, then ST > Oif and only if ST = TS. In thi 
case, ΜΟΥ͂ = ναὶ νΎ. vi sitio 
*(xiil) If T is self-adjoint, there exist positive operators A and B 
such that Τ' = A — Band AB = 0. 


8. If S and T are self-adjoint operators, one writes S < Τ' (or 
T > S) in case 7' — S > 0 in the sense of Fzercise 7. Prove: 
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Ὁ. T < T for every self-adjoint operator T. — 

ii) IfS < TandT <S, thenS = 7. | 

(ἢ) WR<SandS<7,thnkR<T. . 

(iv) ΤΆ Κ' and 7 are self-adjoint, then S < T if and only if (Sz|z) < 
(Τὰ [1] for alla € i. 

(v) If Tisself-adjoint, T < || T || Z,and T > — || T'|| 1. Briefly, 
—\|T\I<S7T<|TI. 

(vi) ΠΑ <Band@ <D,thnA+C<B+D. 

(vii) If S < T and a > 0, then aS < αἴ. 

(viii) If S < 7, then —S > —T. 

(ix) ΤΠ < T, and R is any operator, then R*SR < R*TR. 


9, An operator Τ' is called a contraction in case || Tz || < || x || for 
all « € 50, For an operator 7’, the following conditions are equivalent: 
(a) T is a contraction | 
(b) || TI) <1 
(c) ΤῊΤ < IJ in the sense of Hzercise 8 
(ἃ) TT* <I 
(6) ΤῊ is a contraction 
(ἢ T*T is a contraction. 


10. If 7 is aself-adjoint operator, there isa smallest number ΤΊ, and — 


a largest number m, such that mI < T < MI in the sense of Exercise 
8. One has || T || = max {|m|, {Μ|}. 


11. If T is an operator such that 7*7 > J, then T(3C) is a closed 
linear subspace of 50. Such an operator is injective; it is not necessarily 
surjective. 


12. Let T be a self-adjoint operator, and set R = T + ἡ]. Prove: 

(i) || Ra |? = || Τῷ ||? + || = |P. 

(ii) RB is injective, || Rx || > || x |. 

(iii) R(5C) is a closed linear subspace of σύ. 

(iv) R(K) = K. 

(v) Ris bijective, R~! is continuous, and || R~ || < 1. 

(vi) The operator U = (Τ' — 2I)(T + i) is unitary; it is called 
the Cayley transform of T. 

13. (i) If T is any operator, then the operator R = ΤῊΤ +I is 
bijective, R~! is continuous, and || R~ || < 1. 

(ii) If T > 0, then the operator S = T +I is bijective, S~* is 
continuous, and || S~* || < 1. 
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14. Let Uy = yei1 be the two-sided shift operator, and P the 
projection whose range is the closed linear subspace generated by 
Yo,Y1,Y2,°**. Then, P > 0, UPU* < P, but UPU* # P. 


§7. PROJECTION OPERATORS 
Definition 1. An operator T is called a projection in case T* = T = TT. 


Thus, a projection operator is a self-adjoint operator which equals 
its ‘‘square.” 


Example 1. Let 9% be a closed linear subspace of 5, and 7’ = Px 
the projection of 5 on 9. As shown in Theorem 111.8.1, T is linear 
and || Tz || < || ||, hence 7 is an operator; also 771 = Tx and 
(Tx|y) = (x|Ty), thus T is a projection operator in the sense of 
Definition 1. In fact, every projection operator can be obtained in this 
way from a suitable closed linear subspace: 


Theorem 1. If Τ' is any projection operator, there is one and only one 
closed linear subspace It such that T = Py. Specifically, N is the range 
of T, and 9+ is the null space of T.. 


Proof. 

Let 9 = {y € 3: Ty = y}; since 91 is the null space of the operator 
T — I, it is a closed linear subspace of it. 

The range of T is MN. For, if ye R, then y = Ty € Τί); if 
conversely y © 7'(5), say y = Tz, then Ty = T(Tz) = Tx = y shows 
that y € 2. Thus, NW = ΤΩ. 

By Theorem V.8.6, the null space of 7 is [T'*(5¢)]* = [T(3e)]~ = 
τι Rae 

One has Τ = Py. For, given any x € σὺ, write z = y +2 with 
y & 9ῖ and z Ε N+; quoting Theorem 111.8.1, Pyx = y =y+0= 
Ty + Tz τε Τίῳ - 2) = Tz. 

Uniqueness: if Py, = Pon, then 9 = Py(5C) = Poy(H) = IM. Ι 


Example 2. Let y be a fixed vector of 50, || y || = 1, and let X = 
{\y: ἃ scalar} be the one-dimensional linear subspace generated by y. 
The operator 7 defined by Τὰ = (x|y)y is the projection of σύ on N. 
For, if z € &, one can write z = Ay + 2 with z 1 Mand ἃ a suitable 
scalar, and one has Tx = (Ay + 2\y)y = Aly) + Glyly = 
[A + Oly = Ay = Pye. 
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Example 3. Notation as in T’heorem 3.1. Let P; be the projection whose 
range is the one-dimensional linear subspace generated by ἄρ; by 
Example 2, Pyx = (x|xx)xx. Define T, = δ ἣν uP, forn = 1,2ώ,3,..... 

If x € 3, onehasz = Σ᾿), (α[πρ αν, hence Tz = Do” (x| ae) uate = 
Dd) με(Ρ μὴ) = lima DO) oe(Px) = lim, Τα. Thus, T,2 — Τα, for 
each z; that is, T, — T strongly, in the sense of Exercise IV .8.2. 
Theorem 2. Let I and I be closed linear subspaces of δῦ, P the projec- 
tion with range IU, and Q the projection with range N. Then, M L MW if 
and only if PQ = 0. 


Proof. 

If: Suppose PQ = 0. If σα Ε M and y € M, then (z|y) = (Px| Qy) 
= (| P*Qy) = (x| PQy) = (2/6) = 0. 

Only if: Suppose M L I. If x Ε 50, then Qz Ε 91, hence Οἱ € m+, 
hence P(Qz) = 6. ἢ 


Example 4. If P,,---,P, are projections, and P;P, = 0 whenever 
j #k, then P = Σὰ Ρ isa projection. For, P* = Σὰ Px* = Σὰ Pe 


= r. and PP = Bustin = Σὰ ΡᾺΡι = Σὰ Px = P, See also 
Exercise 6. 


Exercises 
1. An operator Τ' is a projection if and only if T = T*T. 
2. If P is a projection, and Q is unitarily equivalent to P, then Q is 
@ projection. 
3. If P is a projection, then P > 0 in the sense of Ezercise 6.7. If 
P is a non-zero projection, || P || = 1. 


4. If P, Q are projections, and PQ = QP, then PQ is a projection. 
In this case, if St is the range of P and XI is the range of Ὁ, then the 
range of PQ is the intersection of 51 and 9 (see Hxercise I.5.4). 


5. If P is a projection, with range 91, then J — P is a projection, 
with range 951. 


6. Let P and Q be projections, with ranges 9% and δῖ, respectively. 
If 51 1 2, then P + Q is the projection with range N@M (see 
Definition 111.3.8}. 
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7. If P, Q are projections, and PQ = QP, then P+ Q— PQisa 
projection. What is its range? 


8. If P, Ὁ, and P + Q are projections, necessarily PQ = 0. 


9. Let P and Q be projections, with ranges 91 and XN, respectively. 
The following conditions are equivalent: 

(a) P — Q is a projection. 

(b) P > Qin the sense of Fzercise 6.8. 

(c) ἢ Pz|| > || Qe for all 2 Ε΄ xe. 

(d) MDM, 
In this case, the range of P — Q is the set of all vectors common to ὃ 
and 9+. 


10. If P, Q are non-zero projections, and PQ = 0, then || P + Q || 
<|/PI+] Ql]. 


11. If T is an isometric operator, then T'7'* is a projection. 


12. An operator Τ' is said to be partially isometric in case T'*T' is a 
projection. Suppose 7’ is partially isometric, say 7'*7' = P. Prove: 

Gi) TP = T. 

(ii) Set Q = T7'*. Then Ὁ is a projection, and QT = T. Thus, ΤῊ 
is partially isometric. 

(iii) 7 maps Ῥίσ0) isometrically onto Q(5¢), [P(5¢)]* is the null 
space of 7’, and [Q(3¢)]* is the null space of Τ᾿". 

(iv) For an operator S, T'S = 0 if and only if PS = 0. Similarly, 
ST = 0 if and only if SQ = 0. 

(v) If U is unitary, and # is any projection, then UR and RU are 
partially isometric. If S is isometric and T is partially isometric, then 
ST is partially isometric. If S and 7' are partially isometric, ST is not 
necessarily partially isometric. 

13. Suppose 3¢ is finite-dimensional. 

(i) If T is partially isometric, and 7'7'* < 7*T, necessarily ΤΥ = 
ΤῸ 

(ii) If U is unitary, P is a projection, and UPU* < P, necessarily 
UP = PU. Compare with Ezercise 6.14. 


14, If T is an operator with one-dimensional range, there exist 
vectors y,z ΕΞ JC such that 7'x = (x|z)y for all x ΕΞ 3¢. Moreover, 
(i) T*x = (5 y)z for allz € σὺ 
(ii) TT = wT, μ a suitable scalar 
(11) || T |] = ly il lel 
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(iv) 7* = T if and only if y = az for some real number a. Equiv- 

alently, (y|z) = + || y |} || < |]. 

(v) What are necessary and sufficient conditions on y and z, in 
order that T be a projection? 


ἢ 8. NORMAL OPERATORS 
Definition 1. An operator T is said to be normal in case T*T = TT*, 
Every unitary operator is normal; so is every self-adjoint operator. 
An isometric operator is normal if and only if it is unitary. 
Theorem 1. The following conditions on an operator T are equivalent: 
(a) T is normal 
(Ὁ) ΤῊ ds normal 
(0) || T*x || = || Τῷ || for al « € “0. 


Proof. 

The equivalence of (a) and (b) results from ΤῊ" = Τ' 

(a) implies (6): || T*x ||? = (T*x|T*x) = (TT*z|z) = (T*Tz|z) 
= (7 Τα) = || Tz ||?. 

(c) implies (a): (T*Tx|z) = || Tx [13 = || T*x [3 = (TT*z|z) for 
alla. ἢ 
Theorem 2. Let T’ = A + 1B be the Cartesian form of the operator 7’. 
Then, T ts normal if and only if AB = ΒΑ. 
Proof. 

Only if: Assuming T normal, it is clear from the formulas in Theorem 
6.3 that AB = ΒΑ. : 

If: Suppose AB = BA;since ΤῈ = A — 7B, clearly T*T = TT* = 
AA+ BB. ἢ 


Exercises 


1. If T is normal, and S is unitarily equivalent to T, then S is 
normal. 
2. T is normal if and only if T and 7* are metrically equivalent. 


3. (i) If T is normal, and TT = 0, then T = 0. 
(ii) Give an example of an operator 7 + 0 such that TT = 0, 
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4. If T is normal, and u is a scalar, then || ΤῊΣ — y*z || = 
|| Τὰ — μα || for all z € 20. 


5. If 7 is normal, then 7' and ΤῊ have the same null space, and 


T(KH) = T*(K). If 91 is the null space of 7, then 91: = T(3¢). 
6. 


If 7’ is normal, there exists a unitary operator U such that T* = 
UT. 


7. More generally, if S and 7' are metrically equivalent normal 
operators, there exists a unitary operator U such that S = UT. 


8. Let 7 be a normal operator. Prove: 

(i) T is surjective if and only if ΤῊ is surjective. 
(1) T is injective if and only if ΤῈ is injective. 
(iii) T is bijective if and only if ΤῊ is bijective. 
(iv) If T is bijective, then 7*7— is unitary. 


9. An operator 7’ is said to be invertible if there exists an operator 
S such that S? = TS = J; in this case, it is clear that T is bijective, 
and Τ ἢ = S. Prove: 

(i) If S and 7 are invertible operators, then ST is invertible, and 
(GT) Fa Te, 

Gi) If T is an invertible operator, then so is ΤῸ, and (7'*)— = 
1 ἢν 

(i) If 7 is an invertible normal (resp. self-adjoint) operator, then 
T~* is normal (resp. self-adjoint). 

(iv) If T is an invertible operator, then T*T is invertible. The 
converse fails. 

(v) If T is normal, then T is invertible if and only if T*T is invert- 
ible. 

*(vi) It can be shown that every bijective operator T is invertible 
(that is, Τ᾿ ἢ is necessarily continuous). A proof of this is sketched in 
Exercises 11 through 18. 


*10. If 7 is a bijective normal operator, then TJ! is continuous 
(see also Exercise 13). A brief proof results from Exercise 8 and 
Exercise V8.6. 


11. An operator T is said to be bounded below in case there exists 
a constant Ν᾽ > 0 such that || Tz || > N || x || for all « € 3c. Equiv- 
alently, there exists a constant M > 0 such that 7*7 > MT in the 
sense of Wxercise 6.8. Prove: 
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(i) Every invertible operator is bounded below. 

(ii) If an operator Τ' is bounded below, then T is injective, ne 
T (5c) is a closed linear subspace of σῇ. Hence, the mapping 7’: # — 
T(3C) has a continuous inverse. 

(iii) An operator 7’ is invertible if and only if, Τ' is bounded below 
and 7'(5C) is dense in δῦ. 

(iv) An operator Τ' is invertible if and only if, T is bounded below 
and ΤῊ is injective. 

(v) An operator T is invertible if and only if, T is injective and T* 
is bounded below. 

(vi) An operator Τ' is invertible if and only if both 7 and 7* are 
bounded below. 


*12. If T is a surjective operator, then Τ᾽" is bounded below. 
*13. If T is a bijective operator, then Τ᾽ is continuous. 


14. Let T be an operator. 
(i) If 7*7' > J in the sense of Ezercise 6.8, T is not necessarily 
invertible. 
(ii) If T*T > 1 and TT* > I, then T is invertible. 
(iii) If Τ' is normal and 7*T > J, then T is invertible. 
(iv) If Τ' is normal and bounded below, then 7’ is invertible. 


15. An operator S is said to be similar to the operator T in case 
there exists an invertible operator A such that 7’ = A~'SA. Notation: 
S ~ T. Then: 

(i) T~T, for every operator T’. 

(ii) If S~ T, then T ~S. 

(iii) If R ~ S and Καὶ ~ T, then R ~ T. Moreover, 

(iv) If S~ T, then S* ~ ΤῊ, 

(v) If S=T in the sense of Fzercise 5.10, then S ὦ T. 

*(vi) If S and T are normal operators, it can be shown that S = T 
if and only if S ~ T. 


ὃ 9, INVARIANT AND REDUCING SUBSPACES 


Definition 1. A closed linear subspace It is said to be invariant under 
the operator T in case Τίσι) CN. 
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Examples 
1, {0} and & are invariant under every operator 7’. 


2. The null space of an operator T is invariant under Τ'; for, Tz = 
θ implies 7(Tx) = 8. 


3. More generally, if S and T are operators such that ST = TS, 
the null space of T is invariant under S. For, if Tz = 0, then T(Sz) = 
(TS)z = (ST)zx = S(Tz) = 39 = 9. 


Theorem 1. Let 8 and 5 be subsets of 50, Mm = 8.1 ὁ, mM = σ᾽". If T is 
an operator such that T(8) C 5, then T(M) C N. 


Proof. 

Incidentally, 9% (resp. 91) is the smallest closed linear subspace of 
0 which contains $ (resp. 3). By Theorem V.8.4, ΤῊ (σ 1) απ $+, hence 
T**(3++) c 51: a 


Examples 


4, Let ὃ be a subset of δ, 91 the smallest closed linear subspace 
containing 8. If T is an operator such that 7'(S) C 8, then 9 is invar- 
iant under 7’; this follows at once from Theorem 1. 


5. In particular, if 8 is a linear subspace of 50, 91 = 8, and T is an 
operator such that T(S) C 8, then 9 is invariant under 7’. See also 
Exercise 1. 


6. For any operator 7’, 1 (00) is invariant under 7’; this follows from 
the relation 717 (50}] C Τ (50), and Example 6. 


Theorem 2. Lei T be an operator, NM and MN closed linear subspaces of 
KH. The following conditions are equivalent: 


(a) TIM) CMW 
(Ὁ) Τ᾽ 919) Ὁ 9π᾽ 
(0) PxT Py = TP. 
Proof. 
(a) and (Ὁ) are equivalent by Theorem V.8.5. 
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(a) implies (c): Given any x ΕΞ 50, one has Poyx € IM (see Hxample 
7.1); by condition (a), TPsyz € MN, hence PyTPyyx = T Payer. 


(c) implies (a): If y ΕΞ mM, then Ty = ΤΡ = PxT Poy = Pry, 
hence Ty CN. ἢ 


In particular, 9t is invariant under T if and only if 9t~ is invariant 
under 7'*, 


Corollary. A closed linear subspace N is invariant under the operator T 
af and only tf Px TP = TP. 


Definition 2. Let σ᾽ be a closed linear subspace invariant under the 
operator T'. The restriction of T' to NX is the mapping T/N: NW — 91 
defined by (T/S)y = Ty. 


Thus, 7'/2t is the mapping y > Ty, restricted to 91. The following 
is elementary: 


Theorem 3. Lei 9 be a closed linear subspace of δ. 


(1) If 9 ἐξ invariant under the operator T, then Τὶ δὲ 18 an operator in 
the Hilbert space σι, and || Τ δὲ || < || ||. 


(2) If 9 is invariant under the operators S and T, then it is invariant 
under S + T, ST, and XT’; moreover, 


(S + T)/ = (S/H) + (T/H) 
(57) ῖ = (S/N)(T/N) 
(AT) δι = MT/N). 


Definition 3. A closed linear subspace It is said to reduce the operator 
T in case both δῖ. and M+ are invariant under T’. 


Theorem 4. If T is an operator, and δῖ is a closed linear subspace, the 
following conditions are equivalent: 


(a) 91 reduces T 

(Ὁ) 9τ reduces T 

(ec) 9 reduces ΤῸ 

(ἃ) 91s invariant under both T and ΤῊ 
(0) TP yx = PT. 
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Proof. 

The equivalence of (a)—(d) is immediate from T'heorem 2 and the 
definitions. Let P = Px. 

(ἃ) emplies (e): By the Corollary of Theorem 2, PTP = TP and 


PT*P = T*P. Then, PT = P*T = (T*P)* = (PT*P)* = PTP = 
TP. 


(e) implies (d): Assuming PT = TP, one has PTP = TPP = TP, 
hence Xi is invariant under 7' by the cited corollary. Also, ΡΤ = 
(TP)* = (PT)* = T*P, hence δῖ is invariant under 7*. J 


Examples 


7. If T is a normal operator, 91 = 1 (50) = T'*(5C) reduces T; for, 
%~ is the null space of both T and ΤΌ, hence is invariant under them 
both. 

8. If Tx, = 2,4, is the one-sided shift operator (Example 4.1), the 
only closed linear subspaces which reduce T' are {0} and 3c, For, sup- 
pose 9ῖ # {6} is a closed linear subspace which reduces 7’. For a non- 
zero vector y = δι Antz, define the index of y to be the smallest 
subscript ἐς such that δι # 0. Let m be the smallest index of any non- 
zero vector in 9t, and choose any non-zero y € 9 with index m; clearly 
y= es λχῦχκ. Necessarily m = 1; otherwise, 91 would contain the 
non-zero vector Τ᾽ ἣν = p ag λχῶχκ..1 = oes e412, contrary to the 
minimality of m. We may suppose \, = 1, thus y = 2; + poe AgCe. 
One has Ty = T*z, + Don MT*x, = 0+ DOS Auten, hence TT*y 
= Doo λιῖακα = Doo ete = y — ἄχ. Since x, = y — TT *y, clearly 
τι € 51. Then, % also contains Tx; = ro, ΤῊ = x3, and so on. 
Evidently % is total, ὃ: = [0], T= 20. ἢ 
Theorem 5. If St reduces T, then (7'/9t)* = ΤΉ "κι. 

Proof. 

Let αὶ = T/N and Καὶ = T*/2; these are both defined, by condition 
(d) of Theorem 4. For all x,y € 91, (R*x|y) = (z|Ry) = (x|Ty) = 
(7*x|y) = (Sz|y); since R* and S are operators in 1, R* = 5. ἢ 


Corollary 1. Jf % reduces T, and T is normal (resp. unitary), then T/M 
is normal (resp. unitary). 
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Proof. 
By Theorems 8 and 6, (Τ᾽ Τὴ οἵ = (T*/9)(T/N) = (T/N)*(T/N), 
and J/% is the identity operator of N. J 


Corollary 2. If T is self-adjoint, and It is invariant under T, then Τ δι 
ts self-adjoint. 
Proof. 

Since ΤῈ = Τ', St reduces Τ' by Theorem 4. Then, (7'/9)* = ΤῊ δ = 
T/N. ἢ 


Example 9. Suppose 7’ is normal, and 9 is invariant under 7’. Then, 
7 δι is normal if and only if 9t reduces 7. 

For, if 9t reduces 7’, Τ δα is normal by Corollary 1 above. 

Conversely, suppose Rk = 7/2 is normal, that is, R*R = RR*, 
where #* is an operator in 9t. Given z ΕΞ 9S, the problem is to show 
that T*rE 8. For any yOu, (7 2} ν) = (ἡ ΤῊ) = (2|Ry) = 
(R*z|y); thus, (ΤῊΣ — R*z|y) = Ofor ally € x, ΤΣ — νὰ Ε σι". 
Since R*z € I, one has 


| Pz |? = || (1 — R*z) + Rte |? = || T*z — R*z ||? + || R*z |)? 


by Theorem II.6.2. Since T and RF are normal, || ΤΣ || = || Tz || 
and || R*z || = || Rz || = |] Tx ||, hence the above equation reduces to 
|| Tx ||? = || T*e — πὰ ||? + || Tz [[3. Thus, ΤῊΣ — R*z = 6, ΤῊΣ = 
R*z € MN. 


Exercises 


1, Let ὃ and 5 be subsets of the metric spaces X and Y, respectively, 
and let 7': 80 — ‘YY be a continuous mapping such that 7'($) C 5. Then, 
Tis) Cc 5. 

2. If T is isometric, and δῖ is invariant under Τ', then Τ᾽ σ᾽ is 
isometric. 

3. If T has Cartesian form 7' = A + 7B, and 9 reduces 7, then 
ΤΙ has Cartesian form (A/9t) + 7(B/2). 

4. If 91 is invariant under 7’, and P is the projection with range XI, 
then (T/3t)* = (PT*P)/2. 

5. Let ὕψι = yx41 be the two-sided shift operator (Example 5.2), 


MX the smallest closed linear subspace containing y;,12,y3,- ++. Then 91 
is invariant under U, but does not reduce U. 
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6. If T is unitary, and is invariant under Τ', then 7'/2 is unitary 
if and only if Xt reduces 7’. 


7. If T is normal, δῖ is invariant under 7, and 7'/9 is a scalar 
operator, then 2% reduces 7’, 


8. Suppose Τ' is isometric, and δῖ is invariant under 7. Then: 
(i) If T(9t) = I, 91 reduces T. 
(ii) If Τ δῖ is normal, then 9t reduces T and Τ δὲ is unitary. 


9. Call an operator Τ' hyponormal in case |! 7'*z || < |! Τὰ || for all 
xz ΕΞ KX. Prove: 

(i) TY ishyponormal if and only if ΤΤ" < 7'*T in the sense of Fzer- 
cise 6.8. 

(ii) If T is isometric, then 7’ is hyponormal. 

(iii) If 7 is hyponormal, X is invariant under 7’, and 7'/9t is normal, 
then 9t reduces 7’. 


10. Let 9t be a closed linear subspace invariant under the operator 
T. Prove: 
(i) Πψ = Τῇ, || R*x || < || ΤῊΣ |] for alla € 0, 
(ii) If Τ' is hyponormal, so is 7/9. 


11. An operator 7 is hyponormal if and only if there exists an 
operator V, || V || 1, such that ΤῊ = VT. 


12. If T is an invertible hyponormal operator, then 7'~' is hypo- 
normal. 


13. Let Uyr = yr4, be the two-sided shift operator. Given a 
bounded sequence of complex numbers pn (n = 0,-£1,-42,---), let 
R be the unique operator such that Ry; = uxy, for all k. Define Τ' = 
RU. Then, T is hyponormal if and only if [με] < |ux4:| for all k. 
T is normal if and only if all the με lie on the circle in the complex 
plane with center 0 and radius || ; equivalently, 7 is a scalar multi- 
ple of a unitary operator. 


14, *(i) It can be shown that every hyponormal operator on a 
finite-dimensional Hilbert space is normal. 
(ii) If 7 is hyponormal, and % is a finite-dimensional subspace 
invariant under 7’, then 9t reduces 7’. 
15. Let 7 = A + 7B be the Cartesian form of the operator 7’. Let 


AB = C+ Ww be the Cartesian form of AB. Then, 7 is hyponormal 
if and only if D < 0. 
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16. Let Ta, = 24, be the one-sided shift operator. The only 
finite-dimensional subspace 9 invariant under T is 9t = {6}. 


17. If Tx; = x4. is the one-sided shift operator, there exists a 
linear subspace 8, distinct from {6} and 3, such that 7'($) C 8 and 
7T'*(S) < 8. Such a linear subspace is necessarily dense in 20, Among 
all such 8, there is a smallest, that is, one which is contained in every 
other. 


18. Suppose X% reduces the operator J. Then, 7 is an isometric 
operator if and only if both 7'/9t and 7/9t* are isometric. The word 
“Isometric” can be replaced by “unitary,” “normal,” “self-adjoint,” 
“projection,” “hyponormal,” “positive,” “invertible.” 

**19. Given an arbitrary operator 7’. It is not known whether there 
exists a closed linear subspace 9% invariant under 7, other than {6} 
and Jc. 


Proper Values VII 


§ 1. Proper vectors, proper values 


§ 2. Proper subspaces 
§ 3. Approximate proper values 


§1. PROPER VECTORS, PROPER VALUES 


Definition 1. A vector x € 5 1s said to be a proper vector for the opera- 
tor T in case: (1) x ¥ 0, and (2) Tx = μα for a suitable scalar μ. 


If also Tx = vx, then (u — v)z = 0, hence μ = ν. Thus, a proper 
vector x determines uniquely the associated scalar μ. 


Definition 2. A scalar μ is said to be a proper value for the operator T 
in case there exists a vector x τέ 6 such that Tx = μα. 


Thus, μ is a proper value for 7 if and only if the null space of 
T — pl is γέ {6}. 


Examples 


1. If T = pl, every non-zero vector of 3€ is a proper vector for 7; 
the only proper value is μ. 


2. Let P be the projection with range δῖ. Assume 81 ~ {6}, 51 τέ 50. 
If y is any non-zero vector of XN, Py = y = ly shows that y is a proper 
vector, with associated proper value 1. Similarly, every non-zero 
z € 9" is a proper vector, with associated proper value 0. The only 
proper values of P are 0 and 1. Every proper vector of P belongs either 
to 9ῖ or to 91. 


3. Let x be a proper vector for 7, say Τὰ = μα. If 91 is the null 
space of Τ' — ul, every non-zero vector of It is also a proper vector. 
In particular, || x ||~'z is a proper vector of norm 1. 
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4. Given a Hilbert space 3¢ of finite dimension 7, and scalars 
#1,°**,n (not necessarily distinct). If z,,---,2, is any basis of 0, 
there is a unique linear mapping Τ᾽: σὺ — SC such that Tz, = μχῦς 
for all k (see Theorem IV .1.3). See Exercise 1. 


Theorem 1. Lei μι be a proper value for T. Then: 
() lal <I TI. 
(2) If T ts self-adjoint, μ is real. 
(3) If T is isometric, [μ = 1. 


Proof. 
Let x be a non-zero vector such that Τὰ = ur; we may suppose 
| x || = 1. 


(1): [μ] = lee ll =| Tz] <I TI. 
(2): μ = p(x|z) = (μα 1) = (Τ 1 1) is real by Theorem VI.6.1. 
(3): [μ] = || Τὰ || =z) Ξ 1.8 


Example 5. Let 50 be classical Hilbert space, z, an orthonormal basis 
for δῦ, μη a bounded sequence of scalars, and 7' the unique operator 
such that 7'x; = up; for all k (see Theorem VI.3.1). Thus, the μὲ are 
proper values for 7’. In fact, these are the only proper values. For, 
suppose Τὰ = μα, αὶ # 0. Say 1 = oe Apt; then T'x = = AgMELE, 
hence Ague = (Τὰ 2x) = (ut| zu) = u(x| ze) = wrx, λκίμ — we) = O for 
all k. Since x + 0, there is an index m such that A, γέ 0; necessarily 
Lt — Hm = 0, thus μ = pm. Similarly, the μα are the proper values for 
7*; this is a symptom of normality: 


Theorem 2. If T is a normal operator, x is a vector, and yu is a scalar, 
then Tx = μα tf and only if T*x = p*x. In particular: 


(1) xis a proper vector for T if and only if it is a proper vector for T*. 
(2) » isa proper value for T if and only if u* is a proper value for T*. 


Proof. 

By assumption, 7*7' = TT'*. Since (T — μ᾽ = T* — pu *I, clearly 
T — ul isnormal, hence || (7 — μὴ || = || (7 — μὴ ἢ || by Theorem 
VI.8.1. ἢ 
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Examples 


6. The one-sided shift operator Tz, = z44, (see Example VI.4.1) 
has no proper values. For, assuming ΤῊ = μα, let us show that neces- 
sarily « = 6. If wu = 0, Tx = μὰ = Ox = 8; since T is injective, rc = θ. 
Suppose μ ¥ 0. Ifz = Diy Ante, then Τὰ = DOT Metegs = Dog 1%, 
hence 0 = (T'x/|z,) = (ux|x,) = μλι; thus A, = 0. Also, for k > 1, 
Ne—1 = (Tx| x4) = (ur | zx) = urs, hence Ay = w~Ag_1; since Ay = 0, 
this implies 4; = 0 for all k. 

7. If » is a proper value for the operator S, it does not follow that 
u* is a proper value for S*. For instance, let S = ΤῊ, where T is the 
one-sided shift operator. As shown in Example 6, S* has no proper 
values, but Sz; = Τ σι = 6 = Oz, shows that 0 is a proper value for 
5. 


8. There exist normal operators having no proper values. For ex- 
ample, let Uyz = yx, be the two-sided shift operator (see Hxample 
VI.6.2). Suppose Ur = pa, and assume to the contrary that 2 ¥ 0. 
Since U is isometric, |u| = 1. If z = 2 ee Acve, then 


Us = ae MYk41 = > Dee Ag—1Yk- 


For all k, λκ... = (Ux|yx) = u(z| yx) = wrx, hence |Ag_1| ΞΞ [λε[. It 
follows that |x| =|Ao| for all k; since x γέ 0, λο ¥ 0. But DY” | Ax/? 
< οὐ, a contradiction. 


Exercises 
1. Notation as in Example 4. T is an operator, and a complex 
number μ is a proper value for T if and only if » = μὰ for some k. 


2. Notation as in Example 5. If the μὰ are distinct, a non-zero vector 
x is a proper vector for Τ' if and only if z is a scalar multiple of some z,. 


3. Let Tx, = σκιὰ be the one-sided shift operator, and & the unique 
operator such that Ra, = (1/k)2z, for all k (k = 1,2,3,---). Then S = 
TR has no proper values. 


4 If Tx, = 2,4, is the one-sided shift operator, then μι is a proper 
value for Τ᾽ if and only if [μὲ < 1. 


δ. There exists a self-adjoint operator 7’ which has no proper values. 
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6. If T is a hyponormal operator, and Tz = μα, then ΤῊΣ = p*x. 


7. Let T be an operator. Then: 

(i) A non-zero vector x is a proper vector for 7 if and only if 
|(Tz|x)| = || Tx || || « [. 

(ii) In order that 7 have a proper value » such that |u| = || Τ' II, 
it is necessary and sufficient that there exist a vector x such that 
| z || = Land |(Tz]|zx)| = || T |. 

(iii) If a, is a sequence of scalars such that a, > 0 for all k, and 
p Tag 0,7 = 1, then Za Opps, < 1. 


8. Proper vectors and values are definable for a linear mapping of 
a vector space into itself. If Τ' is a continuous linear mapping in a 
normed space, and yu is a proper value for 7, then |u| < || T ||. 


ἢ 2. PROPER SUBSPACES 


Definition 1. /f 7' is an operator and yu is a scalar, the null space of the 
operator T' — ul is called the p-th proper subspace of 7', and is de- 
noted Sir(u). Thus, 


Wr(u) = [x € HN: Tr = pr}. 
Briefly, Nir(u) is the u-space of T. 


Thus, ϑύγίμ) is a closed linear subspace of 50; it is different from {6} 
if and only if μ is a proper value for 7’. A non-zero vector z is a proper 
vector for 7’ if and only if x belongs to some yu-space of 7’. 


Theorem 1. ΠῚ S and T are operators such that ST = TS, then the 
u-spaces of T are invariant under S; that is, 


SOlr(#)) © ϑύγ(μ) for all p. 


Proof. 
If « Ε Nr(u), then T(Sx) = (TS)z = (ST)z = S(Tx) = ϑίμτ) = 
u(Sx) shows that Sx € ϑυγσίῳ). J 


Corollary. The y-spaces of T are invariant under Τ', 
Theorem 2. Jf 7' is a normal operator, 

(1) the u-spaces of T reduce Τ'; 

2) συγ = Mre(u*); 

(3) Wr(u) L ϑ1γ(ν) whenever μ # v. 
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Proof. 

(1): Since T*7T = TT*, Nr(u) is invariant under T* by Theorem 1. 
By the Corollary, 37(u) is also invariant under Τ', hence 9tr(u) reduces 
T by Theorem VI.9.4. 

(2): See Theorem 1.2. 

(3): Given x € Sir(u), y ΕΞ Nr(v), and μ — ν γέ 0, the problem is 
to show that z L y. By (2), T*y = v*y. Then, (Tz|y) = (2| T'*y), 
(ux|y) = (x|v*y), μία [ψ) = v(xly), (μ — »)(w|y) = 0, hence (z]y) = 
0. 


Theorem 1 has a converse, provided Τ᾽ has “sufficiently many” 
proper vectors: 
Theorem 3. Suppose T' is an operator such that the only vector x which 
is orthogonal to every u-space of T is x = 6. Then, the following con- 
ditions on an operator SS are equivalent: 

(a) ST=TS 

(Ὁ) very u-space of T is invariant under 5. 
Proof. 


(b) zmplies (a): Let ὅ1. be the null space of TS — ST; the problem 
is to show that 91 = &, or, equivalently, that σι ὁ = {6}. 


One has ϑύγ(μ) C I; for, if « ΕΞ Wyr(z), by assumption Sx € MWy(u), 
hence 7'(Sz) = p(Sx) = S(ux) = S(Tx), (TS — ST)a = θ. It follows 
that if 2 .1 9, then x L ϑυγίμ) for all μ, hence x = 6 by the hypothesis 
on 7’. 


(a) implies (Ὁ) by Theorem 1. J 


The hypothesis on the u-spaces of JT in Theorem 3 is conveniently 
expressed in terms of the following: 


Definition 2. A family of closed linear subspaces is said to be total in 
case the only vector x which is orthogonal to every subspace Nt belonging 
to the family is x = 0. 

Thus, the condition on Τ' in Theorem 3 is that the u-spaces are a 
total family. It will be shown in § 4 of Chapter VIII that the u-spaces 
of any ‘“‘completely continuous” normal operator are a total family; 
this is the essence of the “spectral theorem”’ for such operators. 
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Exercises 


1. If Τ' is any operator, and P, is the projection whose range is 
Mr(u), then ΤΡ, = TP, = wP,. 


2. Let Tx, = μκακ, with notation as in Example 1.5. 

(i) Describe the u-spaces of 7’. 

(ii) If S is an operator, ST = TS if and only if every yu-space of T 
is invariant under S. 

(iii) If S is an operator such that ST = TS, then S*7T = TS*., 

(iv) If S is an operator, ST = TS if and only if the y-spaces of Τ' 
reduce S. : 


3. Suppose 7’ is a normal operator whose u-spaces are a total family. 
If S is an operator such that ST’ = 7S, then ST'* = T*S. 


4. If the u-spaces of an operator Τ' are a total family, and %p(u) C 
Ip*(u*) for all μ, then 7 is normal. 

5. Let T be a hyponormal operator. 

(i) For every scalar μ, Sir(u) C Mp*(u*). 

(ii) For a fixed scalar μ, let SX = ϑύγ(μ). Then, 91 reduces 7’, and 
Τ᾽ ΚΙ is normal. 

(iii) Mp(u) L Wy(v) whenever μ = ν. 

(iv) If the y-spaces of 7’ are a total family, then Τ' is normal. 


*6. If 3C is finite-dimensional, every hyponormal operator Τ' is 
normal. 

7. Definition 1 makes sense, and Theorem 1 is true, for linear map- 
pings in a vector space. For a continuous linear mapping in a normed 
space, the u-spaces are closed linear subspaces. 


8. Suppose 2;,---,%, are proper vectors for 7, Tx, = upzz. If 
μὰν" μη are distinct, then z,---,2, are linearly independent. 


9. If the u-spaces of 7’ reduce T, and are a total family, then T is 
normal. 


§3. APPROXIMATE PROPER VALUES 


A scalar » is a proper value for the operator 7' if and only if there 
exists a vector ὦ such that || x || = Land || Τὰ — μα || = 0. An operator 
may not have any proper values at all, even if it is normal (see 
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Example 1.8). A 1688 stringent condition on μ᾿ would be the following: 
given any «> 0, there exists a vector x such that || z|| = 1 and 
|| Τὰ — μα || < ε; equivalently, 


Definition 1. A scalar μι is said to be an approximate proper value for 
the operator T in case there exists a sequence of vectors x, such that 
| tn || = 1 and || Tr, — uty || — 0. Briefly, μι is an AP-value for T. 


Examples 
1. Every proper value for T is obviously also an AP-value. 


2. Let Tx; = ppt, with notation as in Hxample 1.5. Assume more- 
over that the sequence μὰ converges to a limit μ, distinct from every 
ux. As shown in Lzample 1.4, » is not a proper value for 7’; however, 
| Τὰς — wre || = || weve — wre || = || ee — μ)χκ || = [με — μί — 0, 
hence pu is an AP-value for 7’. 


3. Let » be an AP-value for Τ', and suppose || 7'z, — μας || — 0, 
where || zp || = 1. Then, |(T'x_|2%n) — μ] = [( πη 25} — (tal zn)| = 
|(T2n — utn|tn)| < {Τὰν — μαι || > 0, hence (7,22) — ἡ, 
ce — |u|. Clearly |u| < LUB {|(Tz/z)|: ||z|| <1} < 


Theorem 1. The following conditions on an operator T are equivalent: 
(a) T has an AP-value μι such that |p| = || T II. 
(Ὁ) LUB {|(Tz|z)|: || || <1} =| TIL. 

Proof. 


(a) implies (b): Suppose [μ] = || Τ' |, || en || = 1, and || Ta, — μας || 
— 0. As shown in Example 8, |(Txp|%n)| — |u| = || Tl. If 
M denotes the indicated LUB, one has || T || > M > |(Tx,|zn)| -- 
|| 7 || asm — οὐ, hence M = || T |]. 


(Ὁ) implies (a): Choose a sequence of vectors x, such that || z, || = 1 
and |(T'xn|2n)| — || T ||. Passing to a subsequence, we may assume 
(T'xn|%n) — pu for a suitable scalar μ, hence [μ| = || T ||. Then, 0 < 
| Τὰς — wan [15 = || Ten ||? — (Τα! μας) — (μα, Τὰς) + lu? = 
|| Τὰς |? — (Tan|utn) — (T2n|n%n)* + ||? < || T ||? — (Tn| μα) — 
(T2n|utn)* + lel? = |u|? — μ᾿ ΓΤ, τὴ) — w(T2n|an)* + |u|? > 
(ul? — μὴμ — μμ" + |u[? = Oasn — οὐ Thus, || Τα, — urn || -- 0, 
and » isan AP-value for 7. J 
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There exist self-adjoint operators having no proper values at all 
(see Exercise 1.5). However, it can be shown that every operator has 
at least one AP-value. For our purposes, the following result will 
suffice: 


Theorem 2. If T is a self-adjoint operator, either || T || or —|| T' || is an 
AP-value for T. 


Proof. 

Condition (Ὁ) of Theorem 1 holds for T, by Theorem VJI.6.4. Let 
(ΤᾺ, 1 1.) — pw, with notation as in the proof of Theorem 1. Since 
(T'%n|2%,) is real for all n, μὶ 15 real. Since |x| = || Z|], one has p? = 
(7. @ 


Exercises 


1. AP-values are definable for a linear mapping in a normed space. 
If T is a continuous linear mapping, and yu is an AP-value for Τ', then 
le] <7 I. 


2. If wis an AP-value for 7, then μ + is an AP-value for T + XJ, 
and Au is an AP-value for AT. 


3. If T is isometric, and » is an AP-value for 7’, then [μ = 1. 
4. If Τ' is self-adjoint, every AP-value for T is real. 


5. If Τ᾽ is normal, and » isan AP-value for 7’, then u* is an AP-value 
ἴον ΤῊ. 


6. Let T be a hyponormal operator. 

(i) If μ is an AP-value for 7’, then μὲ is an AP-value for T*. 

(ii) Let wand ν be AP-values for 7, μ τέ v. If 2, and y, are sequences 
of unit vectors such that || ΤΏ, — pr, || > Oand || ΤῊ, — vy || > 0, 
then (2n|yn) — 0. 


7. If wis an AP-value for the operator S, it does not follow that μὲ 
is an AP-value for S*. 


8. (i) w is an AP-value for 7 if and only if Τ' — yl fails to be 
bounded below in the sense of Exercise VI.8.11. 

(ii) If » is an AP-value for Τ', T — uJ cannot be invertible. 

(iii) If Tx, = a4, is the one-sided shift operator, then 
T = T — OJ is not invertible, but 0 is not an AP-value for 7’. 
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*9. It can be shown that every normal operator Τ' has an AP-value 
μ such that |u| = || T ||, hence |] 7 || = LUB {|(T2|z)|:|| 2 || < 1]. 
It would be nice to have a proof in the reverse order. 


10. Let T be an operator, μι a scalar. The following conditions are 
equivalent: 
(a) μ is an AP-value for T. 
(Ὁ) There exists a sequence of operators S, such that || S, || = 1 
and || (7 — μὴ, || — 0. 


*11. Exercise 10 can be generalized to normed spaces. 


12. The spectrum of an operator T is defined to be the set of all 
scalars » such that 7’ — ul fails to be invertible in the sense of Eer- 
cise VI.8.9; it will be denoted s(7’). Then: 

(i) If mis an AP-value for εἰ ne 53(1). 

(ii) μ (7) if and only if μὲ € (Τ᾽ ἢ. 

Gii) s(T +X) = {ut+A:n€ 8(T)}. 

(iv) sQT) = ἰλμ: μ € 8(T)}. 

(v) T is invertible if and only if 0 is not in s(T). 

(vi) If f is invertible, s(T—') = {μτὶ: μ € s(T)}. 

(vii) If 7 is self-adjoint, every μ € s(T) is real. 

(viii) If 7 is positive, every μ € s(T) is > 0. 

*(ix) It can be shown that for every operator 7’, s(7’') is a non-empty 
and closed set of complex numbers, and |x! < || Τ' || for all μ ΕΞ s(T). 

(x) If T is normal, μ € s(T’) if and only if » is an AP-value for 
Ψ' 

(xi) If 7 is unitary, then [μὲ = 1 for every » € s(T). 

*(xiil) If Tx, = x4, is the one-sided shift operator, 8(77 is the 
entire dise |u| < 1. 


*13. If A is any non-empty closed and bounded set of complex 
numbers, and 50 is classical Hilbert space, there exists a normal oper- 
ator 7᾽ such that (1) A is the spectrum of 7, and (2) μ € Aif and only 
if w is an AP-value for T. 
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§ 1. Completely continuous operators 

§ 2. An example 

§ 3. Proper values of CC-operators 

§ 4, Spectral theorem for a normal CC-operator 


§ 1. COMPLETELY CONTINUOUS OPERATORS 


If the u-spaces of an operator T are a total family, and S is any 
operator, ST’ = ΤΆ if and only if the u-spaces of 7’ are invariant under 
S; this shows the power of proper values. However, the u-spaces of 
an operator, even a self-adjoint one, may all be {6}; this shows their 
weakness. 

Every self-adjoint operator 7’ does have an AP-value μ, in fact, one 
for which |u| = || 7' ||. Happily, there is an important class of oper- 
ators for which AP-values are nearly always proper (see Theorem 3.2) ; 
these are the completely continuous operators: 


Definition 1. An operator Τ' is said to be completely continuous in case: 
given any sequence of vectors x, such that || %p || is bounded, Tx, has a 
convergent subsequence. Briefly, T is a CC-operator. 

Clearly, an operator Τ' is CC if and only if: || 2, || < 1 implies Tz, 
has a convergent subsequence. 


Examples 
1. The zero operator is CC. 
2. If y and z are fixed vectors, the operator Τ' defined by the formula 


Tx = (x|y)z is CC. For, suppose || zp || < 1. Since | (waly)| < || y Il, 
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(x,|y) has a convergent subsequence, say (tn, |y) — A; then T2p, 
(tn, |y)z — dz. 


3. If 5X is infinite-dimensional, the identity operator is not CC. For, 
if %1,%2,%3,--- is any orthonormal sequence (see Theorems 1.7.3 and 
11.6.2), || Itm — [τῳ [15 = || tm — Zn ||? = 2 whenever m τέ n, hence 
Iz, cannot have a convergent subsequence. ἢ 


The results of this section and the next show that CC-operators 
exist abundantly; § 3 is preliminary to § 4, in which normal CC-oper- 
ators are analyzed in detail. 


Theorem 1. If T' ts a CC-operator, and d is a scalar, XT is a CC-operator. 


Proof. 
If Tzn, — y, then (AT)ap, — Ay. J 


Theorem 2. Jf S and T are CC-operators, Καὶ +- T is a CC-operator. 


Proof. 

Given || 2, || < 1, the problem is to find a convergent subsequence 
of Sz, + Tz. Passing to a subsequence, we may suppose Sz, — wu. 
For a further subsequence, T'z,, — v; since also Sin, — 4, 


S+ T)t%,- utr. ἢ 


Definition 2. An operator T is said to be finite-dimensional if iis range 
T(3C) is a finite-dimensional linear subspace. 


Theorem 3. Hvery finite-dimensional operator T is CC, 


Proof. 

Let 2,---,2, be an orthonormal basis for T(5¢). For each z € 00, 
Tz = De Ϊκία)δε, where f;(x),---,fx(z) are scalars uniquely deter- 
mined by z; clearly f,(z) = (Tz|z%) = (x|T*z,). Define Tyr = 
Ϊκία)ξε = (ΟἹ T*2x)2,; T;, is a CC-operator by Example 2, hence T = 
> Ts is CC by Theorem 2. ἢ 


Theorem 4. /f T' is a CC-operator, and S is any operator, then ST and 
T'S are CC-operators. 


Proof. 

Given || z, || < 1. For a suitable subsequence, 7'z,, — u, hence 
STxn, — Su; thus, ST is CC. Also, |! Sz, || < || S ||, hence T(Sz,,) has 
a convergent subsequence; thus, TSis CC. J 
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Theorem 5. If 7’, is ἃ sequence of CC-operators, T is an operator, and 
|| 7 — Τὶ || — 0, then T is a CC-operator. 


Proof. (“diagonal procedure’’) 

Given || x, || < 1, the problem is to find a convergent subsequence 
of Τά... 

Let x} be a subsequence of x, for which 7:2, is convergent. Say, 


Let x? be a subsequence of x, for which Tox? is convergent. Say, 
Tot πω. | 

Continuing inductively, one obtains, for each k, a subsequence σῇ 
of αἰ such that 7.2, — Ux. 

The x may be arranged in a rectangular array, with αἵ τὸ, αβ," τ τ as 
the k’th row. Then, the k’th row is by construction a subsequence of 
the (k — 1)’th row. Consider the “diagonal sequence” αἱ ah χϑ ὁ οὶ 
For each k, the sequence αὐ τὶ, 3," - - is clearly a subsequence of 
the k’th row, hence 


lim T,2% = uz for k = 1,2,3,:-:. 
n—0 
We assert that Τὰν is a Cauchy sequence. For, given any « > 0, 

fix any index k such that || 7 — 7; || <.«. For all m and π, 
| Ταῖς — Ταῦ || < || (1 — Τι)αῖ || + || Term — Teas || + || (Te — Tan ll 
< e+ || Tee — Tyr ||. Since Τ τη — uz, as n — οὐ, there is an 
index N such that || 7.2% — Τ καῖ || < « whenever m,n > N. Then, 
|| Tz™ — Τὰν || < 3e whenever m,n > N. This shows that the sequence 
Tx" is Cauchy; since σὺ is complete, it is convergent. J 


Corollary. Jf Τὶ, is a sequence of finite-dimensional operators, T 1s an 
operator, and || T,, — T || — 0, then T is a CC-operator. 


Proof. ; 
Each 7’, is a CC-operator by Theorem 3, hence T is CC by Theo- 
rem ὃ. ἢ 


Exercises 


1. Let T be an operator, ὃῖ a closed linear subspace of 5. 

(i) If 9 is invariant under 7’, and T is CC, then Τ᾽ δὰ is CC. 

(ii) If 9 reduces 7’, T is CC if and only if both Τ᾽ δὲ and T/x* are 
CC. 


2. If 20 is finite-dimensional, every linear mapping T: 5% — σῷ 158 
CC-operator. 
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3. (i) Τῇ S and Τ' are metrically equivalent operators, and T is CC, 
then S is CC, 

(ii) In particular, if T is normal and CC, then ΤῸ is CC. See also 
Exercise 6. 


4. If T isa CC-operator, and S*S < T*T’, then S is a CC-operator. 
In particular, if 7’ is a hyponormal CC-operator, ΤῈ is CC. See also 
Exercise 6. 


5. If 50 is infinite-dimensional, a CC-operator cannot be invertible. 


6. (i) An operator T is CC if and only if 7*T' is CC. 
(ii) If 7 is a CC-operator, then T'* is CC. 


7. (i) If σὺ is infinite-dimensional, and the operator T is bounded 
below, then 7’ cannot be CC. 
ἘΠῚ) If 70 is infinite-dimensional, a CC-operator cannot be surjective. 


8. (i) If y and z are fixed vectors, and y + θ, there exists a CC- 
operator Τ' such that Ty = z. 

(ii) More generally, the set of CC-operators is n-fold transitive in 
the following sense: if y,---,%/n are linearly independent, and Ζ:," " -,2, 
are arbitrary, there exists a CC-operator 7' such that Ty; = 2; for all 
k. 


9. If ἃ is an algebra, an ideal of @ is a linear subspace 4 such that: 
ifz Ε g,anda € @is arbitrary, both az ΕΞ g and za € J. Evidently, 
an ideal of @ is a subalgebra. The results of this section can be sum- 
marized as follows: 

(i) The CC-operators form a closed ideal £,.(5¢) of the Banach 
algebra £,.(5C), and this ideal contains every finite-dimensional oper- 
ator. 

(ii) «“,ε(σ0) is itself a Banach algebra. 

(iii) One has J € S&,¢(3¢) if and only if 3¢ is finite-dimensional, and 
in this case £,,(5C) = £,(5C). 

(iv) In view of Lzercise 6, £.-(5C) is a *-subalgebra of £,(5), hence 
is a B*-algebra. 

10. Notation as in Erercise V.8.11, with K = 50 a classical Hilbert 
space. Prove: 

(i) The *-algebra £),,(5C,5C) of operators of Hilbert-Schmidt class is 
an ideal of £,(5C). 

(ii) Every finite-dimensional operator is of Hilbert-Schmidt class. 
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*11. If 9 is an ideal of £,(3¢), different from {0}, then 9 contains 
every finite-dimensional operator. 


12. If ὃ and § are normed spaces, complete continuity is definable 
for a linear mapping 7: ὃ — σ. If T is CC, it is necessarily contin- 
uous. 


13. Let ὃ and δ᾽ be normed spaces, and denote by £,-(8,) the set 
of all CC linear mappings 7: ὃ — σι Then: 
(i) Lec(&,F) is a linear subspace of £,(&,5). 
(ii) £--(&,8) is an ideal in the algebra £,(8), hence is a normed 
algebra. 
(iii) If δ is a Banach space, £,¢(&,5) is a closed linear subspace of 
the Banach space £,(&,5), hence is itself a Banach space. 
(iv) If 5 isa Banach space, £,-(¥,5) is a closed ideal of the Banach 
algebra £,(%), hence is itself a Banach algebra. 
*(v) If T: ὃ — Fis a continuous linear mapping such that 7(8) is 
finite-dimensional, then T is CC. 
*(vi) It can be shown that the identity mapping J: 8 — 8 is CC 
if and only if ὃ is finite~dimensional. In this case, every linear mapping 
T:& — &is CC. 


14. If ὃ and § are normed spaces, and ὃ is finite-dimensional, then 
every linear mapping 7: ὃ -- § is CC. 


15. Let ὃ be a normed space, and St a dense linear subspace of 8. 
Prove: 

(i) If € &, there exists a sequence y, € 9 such that y, — 2, and 
ll yn || = |] x || for all n. 

(ii) Let ὦ be a Banach space, S: 51 — @ a continuous linear map- 
ping. If 7’: ὃ — @ is the unique continuous linear mapping such that 
Ty = Sy for all y € 91 (see Theorem IV.7.6), then T is CC if and only 
if S is CC. 


*16. If ὃ is a normed space, ὦ is a Banach space, and 7:8 > @is 
a continuous linear mapping, it can be shown that 7’ is CC if and only 
if T’ is CC. 


17. If ὃ and & are normed spaces, consider the set of all linear map- 
pings Τ᾽: ὃ — δ᾽ satisfying the following condition: if z, € 8, || x, || < 1, 
Tz, has a Cauchy subsequence. Examine the theorems of this sec- 
tion, with complete continuity replaced by this condition, 
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§2. AN EXAMPLE 


Let 50 be classical Hilbert space, z, an orthonormal basis for 3C, up 
a bounded sequence of scalars, and 7’ the unique operator such that 
Τὰς = μκῦκ for all k (see Theorem VI.3.1). If Px is the projection 
whose range is the one-dimensional subspace generated by ἀρ, and 
Τ᾽, τῷ p Ἄδα uxP,, one has 7,2 — Tx for each x € HX (see Example 
VI.7.3). It will be shown in § 4 that if T is CC, necessarily p, — 0; 
we are concerned here with the converse: 

Theorem 1. If un — 0, then || T — Τὶ || — 0 and T is completely 
continuous. 
Proof. 

The range of 7᾽, is contained in the n-dimensional subspace gen- 
erated by 21,-~--,%n, hence 7, is a finite-dimensional operator. In view 
of the Corollary of Theorem 1.5, it will suffice to show that 

Given e > 0, let NV be an index such that [με] < «wheneverk > N. 
Fix an index n > N; it will be shown that || 7 — Τὶ || < ε. For any 
vector z = ie λχῖε, one has Τ',Ζ = hve λεμκῶχ, hence || Tz — T'yx ||? 
= || Do λέμε — Σὰ Namaste ||? = |] DOP, Memesre [2 = DOP | ee |? 
SO iyi? SP Dye? = ἐ || x ||?. Thus, || (7 — 7,)z || < 
e-|| x ||; since 2 is arbitrary, || Τ' —T7,||}<« ἢ 


Exercises 

1. Let 5 be classical Hilbert space, z, an orthonormal basis for 3, 
and (a;,) an “infinite matrix” of scalars (7 = 1,2,3,---;k = 1,2,3,---) 
such that )’,,|ajx|” <0. Then, there exists one and only one op- 
erator 7’ such that Tx; = 5 a a;,2; for all k; this operator is CC. Every 
operator of Hilbert-Schmidt class is completely continuous. 

2. If i is classical Hilbert space, z, is an orthonormal basis for 3¢, 
and §;, are scalars such that Sa | Byte! < οὐ, there exists a unique 


operator 7’ whose matrix is (8;,) relative to the given basis. This op- 
erator is of Hilbert-Schmidt class, hence is CC. 
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83, PROPER VALUES OF CC-OPERATORS 


Recall that if 7 is an operator and u is a scalar, 9t7(u) denotes the 
u-space of 7’; it is the closed linear subspace {x € SC: Tx = pa}, that 
is, the null space of the operator T' — zl. 


Theorem 1. If 7 is a CC-operator, and p # 0, then ϑύτ(μ) 18 finite- 
dimensional, 
Proof. 

Assume to the contrary; by Theorem 1.7.8, Sir(u) contains a linearly 
independent sequence 2,. By Theorem II.6.4, we may suppose the z, 
are orthonormal. If mn, || Ttm— Τὰς ||? = || etm — Hen ||? = 
|u|? || em — xn ||? = 2 |u|? > 0, hence Tz, cannot have a convergent 
subsequence; since || 2, || = 1, this is contrary to the complete con- 
tinuity of T. J 
Definition 1. Suppose u is a proper value for the operator T. If Nr(u) 
has finite dimension n, μ is said to be a proper value of finite mullti- 
plicity n. If ϑυγίμ) is infinite-dimensional, μ is sard to have infinite 
multiplicity. If μι is not a proper value for T, that is, if Nr(u) = {6}, 
it is convenient to say that μι is a “‘proper value of multiplicity zero.” 


In this terminology, Theorem 1 asserts that every non-zero proper 
value of a CC-operator has finite multiplicity. This result is not always 
helpful, for there exist CC-operators having no proper values at all: 


Example 1. Let 7x; = 2,4, be the one-sided shift operator, A the 
unique operator such that Ra, = (1/k)2, for all k, and let S = TR. 
Since R is CC by Theorem 2.1, S is CC by Theorem 1.4; however, S has 
no proper values (see Exercise VII.1.3). ἢ 


Every normal CC-operator does have at least one proper value, as 
is shown in Theorem 8 below. This result is expedited by the following 


Theorem 2. If T is a CC-operator, and μ is a non-zero approximate 
proper value for Τ', then μ is a proper value. 


Proof. 

By assumption, there exists a sequence of vectors x, such that 
| z, || = land || Tz, — utp || — 0. Since T is CC, we may suppose, 
after passing to a subsequence, that T'z, is convergent, say Tr, — τι. 
Since || uw — urn || < || uw — Ten || + || Ten — u%n || — 0, one has 
ut, -- u. Then, T(ut,) > Tu, thus Tu = lim T(ur,) = lim p(T2n) 
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=u lim Τῶν = ww, Since {αὶ || = lim || μα [| = lim |u| = [μ] 0, 
u is a proper vector. ἢ 


Lemma 1. // S is any self-adjoint CC-operator, S has a proper value μ 
such that |u| = || S|]. 


Proof. 

If S=0, » = 0 is a proper value (we are assuming 50 γέ {6}). If 
S #0, then by Theorem VII.3.2, S has an approximate proper value 
μι such that |u| = || S || > 0; quote Theorem 2. ἢ 


Lemma 2. Jf Ὁ is finite-dimensional, every normal operator T has at 
least one proper value. 


Proof. 

Every operator in 5 is CC, by Theorem 1.3. Let T = A +B be the 
Cartesian form of 7’ (see Theorem VI.6.3). By Lemma 1, there is a 
(real) scalar a such that the proper subspace % = 94(a) is ¥ {6}. 
Since 7' is normal, AB = BA (see Theorem VI.8.2), hence 9 is in- 
een lonege B (see Theorem VII.2.1). Let R = B/N be the restric- 
tion of αὶ to I; # is a self-adjoint operator in 9% (see Corollary 2 of 
Theorem VI.9.5). By Lemma 1, there exists a scalar 6, and nia 
vector x € 2 such that kz = Bz, that is, Br = Bx. By the definition 
of N, Ax = ax, hence Tx = Ax + iB = (a + ἴβ)α. Ι | 


Theorem 3. Every normal CC-operator T has at least one proper value; 
tn fact, there exists a proper value μι such that |u| = || T ||. 
Proof. 

Let S = T*T; S is self-adjoint, and is CC by Theorem 1.4. We may 
suppose 7’ + 0, hence || S || > 0. 

_ By the definition of || T ||, there is a sequence of vectors 2», such 
that || x, || = 1 and || Tz, || — || T ||. Then, || S$ || = || T*7 || = 
| Τ |? = lim || Τὰ, ||? = lim (ΤΈΤΕ, [ἀη) = lim (Seq |29). 

Passing to a subsequence, we may suppose, by the complete con- 
tinuity of 7, that Tx, — y for a suitable vector y. 

_ Let a = ||S||. We assert that a is a proper value for S. For, 
|| San — atn||? = || Sty ||? — 2a(Satp | Xn) + a? < a? — 2a(Stq|%n) +a? i 
— 2a? + a? = 0, thus St, — at, — 8. Since Tz, — y, one has 
St, = ΤΉ Τα.) — T*y, hence at, = (at, — Stn) + Stn - 6 + 
T*y. Thus, τη — a~'(T*y). Setting z = a—!(T*y), one has σης — z 
|| 2 || = lim || 2, || = lim 1 = 1, and Sz = lim Sz, = T*y = az. Thus, 


Ζ is a proper vector for S, with associated proper value «. [One could 
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also have quoted Lemma 1, since S > 0; see the remarks following 
this theorem.] 

Let NX = MNg(a); it has just been shown that δ᾽ ~ {6}. Since a > 0, 
91 is finite-dimensional by Theorem 1. Since T is normal, clearly TS = 
ST and T*S = ST*, hence % reduces Τ' (see Theorems VII.2.1 and 
VI.9.4). Let R = T/2; R is a normal operator in 9t (see Corollary 1 
of Theorem VI.9.5). By Lemma 2, there exists a scalar μ, and a vector 
2 € 0, such that || x || = 1 and Ra = uz, that is, Tz = μα. By T heo- 
rem VII.1.2, ΤῊΣ = p*x, hence Sx = ΤῊ ΤΩ = |u|?a; since Sx = ax 
by the definition of 9t, |u|? = a. Thus, |u|? = || S || = || 7 Pr ἢ 


Remarks on Theorem 3 


Suppose 7’ is a normal CC-operator. Since || T*2m — T*tp || = 
| T*(m — tn) || = || Tm — σοὶ ἢ} = || Tm — Tn ||, it is clear that 
T* is also CC. Let T = A + 4B be the Cartesian form; it is clear from 
the formulas in Theorem VI.6.3 that A and B are CC-operators (see 
Theorems 1.1 and 1.2). It follows that the proof of Lemma 2 is valid 
for T without the assumption of finite-dimensionality: T has at least 
one proper value. 

Why mention Lemma 2 at all? The point is that the existence of 
proper values for a linear mapping in finite-dimensional space is an 
essentially algebraic fact (see Exercise 5) which may be known to the 
reader from other contexts. Granted Lemma 2, one way or another, 
the proof of Theorem 3 derives directly from Lemma 2, and one need 
not quote Theorem 2 and Lemma 1; in this way, the concept of AP- 
value is circumvented, thus emphasizing the algebraic aspects of 
Theorem 8 and the spectral theorem to follow. 


Exercises 


1. (i) If T is a normal CC-operator, there exists a vector ὦ, ||| = 
1, such that |(7'z|2x)| = || T ||. Compare this with Exercise VII.1.7. 

*(ii) It ean be shown that for every normal operator 7, || 7 | 
LUB {|(Tz|z)|: || |] = 1}. 

2. Notation as in the proof of Theorem 3. Then, 91 contains all 
vectors x such that Τὰ = dx for some scalar ἃ with |A| = || 7’ ||. One 
has || Τὰ || = || 7'|| || || for every x Ε. %. 

3. Let Rx. = (1/k)ax, as in Example 1. Risa self-adjoint CC-oper- 
ator, 0 is an AP-value for R, but 0 is not a proper value. 


Vill §4 Completely Continuous Operators 181 


*4. It can be shown that every operator T has at least one AP-value 
u. If moreover 7’ is CC and μ # 0, T has a proper value. 


"δ. It can be shown that if 20 is finite-dimensi 
| ensional, every tor 
T has at least one proper value. dal 


*6. Theorem 1 holds fora CC linear mapping 77: 8 — &, where & is 
a normed space. | 


*7. In the wake of a result such as Theorem 2, the reader should be 
warned of the existence of a stubborn class of operators whose only 
AP-values are 0. Such an operator is called a generalized nilpotent. It 
can be shown that the following conditions on an operator 7’ are 
equivalent: 

(a) T is a generalized nilpotent. 
(b) The spectrum of T is {0}. 
ει lim || 7 ||!/" = 0. 

e operators 7” are defined inductively by the form 1 = 
re = cana ce T is called a saw naan ee: naa hi a 
for some π.] It can be shown that th or i i 
ta ee the only normal generalized nil- 


8. The operator S of Example 1 is a generalized nilpotent. 


§4. SPECTRAL THEOREM FOR A NORMAL CC-OPERATOR 
Throughout this section, 7’ is a fixed normal CC-operator. 


Theorem 1. (Spectral Theorem) T’he proper subspaces of T are 
Sper rem) The : a total 
family. That is, if x L Ny(u) for every scalar μ, necessarily x = 0. 


Proof. 
ni : be eee subset of 5C such that ϑύγ(μ) C § for all μ; 
clearly ὃ = {x € δῦ: 2 ΕΞ ϑυγίμ) for some μῇ. Let δῖ = $+; the m 
is to show that 2 = {6}. , nian 
Since each Wip(u) reduces Τ' (Theorem VII.2.2), T(S) ΓΞ 8 and T* 
2.2), | : 8) 
C 8; by Theorem V.8.4, T*(9t) C 51 and T(9) C dt, thus οἵ . 
T (see Theorem ΤΊ ὃ, Ὁ. ae 
Assume to the contrary that 9 ~ {0}, and let R= Τα; Risa 
normal operator in Nt (see Corollary 1 of Theorem VI.9.5). We assert 
that R is a CC-operator in the Hilbert space %. For, if a, € σἵ 
and || 2, || <1, Ra, = Tz, has a convergent subsequence, say 
χη, — Ε 5; since X is closed, u C N. 
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Thus, FR is a normal CC-operator in the Hilbert space 1 ~ {6}. By 
Theorem 3.3, there exists a non-zero vector « € 9, and a scalar μ, such 
that Rx = μα, thatis, Tx = μα. Then, « ΕΞ ϑυγ(μ) C 8; butz C 51 = 
$+, hence x L x, x = θ, a contradiction. J 

Let P(u) denote the projection with range 9tr7(u). [More precisely, 
P(u) = Pr(u); thus, Pr+(u) = Pr(u*).] The following result 1s essen- 
tially an application of the spectral theorem: 


Theorem 2. The following conditions on an operator S are equivalent: 


(a) ST=TS 
(Ὁ) The p-spaces of T are invariant under S. 
(6) ST* = T*S 


(d) The p-spaces of T reduce 5. 

(e) SP(u) = P(u)S for all μ. 
Proof. 

(a) implies (Ὁ): See Theorem VII.2.1. 

(Ὁ) implies (c): Since Itr+(u) = Ir(u*), the u-spaces of 7* are in- 
variant under S; since they are a total family by the spectral theorem, 
ST* = ΤῊΝ by Theorem VIJ.2.3. 

(c) implies (ἃ): The y-spaces of ΤῈ are invariant under S by Theorem 
V1J.2.1; in other words, the u-spaces of T are invariant under S. Since 
S*T = 7'S*, the u-spaces of 7 are also invariant under S*. 

(d) and (e) are equivalent by Theorem VI.9.4. 

(d) implies (a) by Theorem VII.2.3 and the spectral theorem. J 


The next results will show that the proper values of 7’ can be 
enumerated in a (finite or infinite) sequence. 


Theorem 3. For each « > 0, the annulus ε < |A| < || T || contains at 
most finitely many proper values of T. Every proper value of Τ᾿ lies in 
the disc || < || T ||; that is, Nr(A) = {8} whenever |d| > || T |]. 


Proof. 

If » is a proper value of T, |u| < || 7 || was shown in Theorem 
VIJ.1.1; this does not require any special hypotheses on 7’, 

Given « > 0, assume to the contrary that there exists an infinite 
sequence μα of distinct proper values of 7’ such that ε < [μι] < || T |). 
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Passing to a subsequence, we may assume μη, —> μι where e < |u| < 
| T ||. Let Tan = untn, || tn || = 1. Again passing to a subsequence; 
Wwe may assume Tx, — y for a suitable vector y. Thus, unt, — y; 
since wn’ — w', one has ζη = un" (untn) — wy, hence 
|| tm — tn || — Oasm,n — ©, Butz, is an orthonormal sequence by 
Theorem VII.2.2, hence || tm — σὰ ||? = 2 whenever m τέ n, a con- 
tradiction. J 

Let us first dispose of a special case: 
Theorem 4. The following conditions on T are equivalent: 

(a) T is finite-dimensional. 

(Ὁ) T has only finitely many distinct proper values μ1," - + ,ttn.- 
In this case, T = D0 xP (ux). 


Proof. 

(a) implies (b): Assume to the contrary there is an infinite sequence 
141,H2,ég,*** Of distinct proper values. Let Τὰς = μα, || xz || = 1; the 
sequence of vectors x, is orthonormal, hence linearly independent. 
Since at most one of the μα is 0, τρ = T'(u,—'a,) ΕΞ T(5C) for the re- 
maining ᾿ς, hence 7'(3C) is infinite-dimensional. 

(b) implies (a): Let I be the smallest linear subspace containing 
Sp (4), + +, Ir(u,). Evidently, I is the set of all vectors x of the form 
[= he Ye, with yz ΕΞ Wr(ux) (see Theorem I.5.1). Since the Np(uz) 
are mutually orthogonal, it is appropriate to write I = ϑυγίμι) - -Φ 
Sir(un), in analogy with Definition IIJ.2.5. We assert that IN = 50; 
since SW is closed (see the Corollary of Theorem III.4.2), it will suffice to 
show that m+ = {6}. Indeed, let 


ὃ = {x Ε 50:2 € Myp(u) forsome pz}; 
clearly m+ = 5. But $+ = {6} was shown in the proof of Theorem 1. 
Thus, given any vector « € 5, one can write x = D>) με, with 
Ye & Ny (ux). fj γέ k, then yz € [Str(uj)]*, hence P(u;)y, = 0; it fol- 
lows that P(u;)¢ = La P(u;)ye = yj, thus «= 2 Ρίμι)Σ. This 
shows that >, Ρίμι) = I. Also, 


Τὰ = Le Ty, = ΩΝ MEY: = pe μι" (ux), 


hence Τ' = pes ueP (ur). 
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Incidentally, if T is also injective, 50 is necessarily finite-dimen- 
sional. ἢ 

Assume henceforth that T is not finite-dimensional; that is, T has 
infinitely many proper values. 


Theorem 5. Assuming T is nol a finite-dimensional operator, 


(1) the distinct non-zero proper values of T can be enumerated in 

an infinile sequence f1,m2,u3,*** ; 

(2) necessarily un — 0; 

(3) one can arrange to have || = |2| 2 [με] Ὁ "5: 

Proof. 

(1): For each n = 1,2,3,-++, let A, denote the set (possibly empty) 
of all proper values μ᾿ such that 1/n < |u| < || T ||. By Theorem 3, 
each A, contains at most finitely many scalars, and every non-zero 
proper value belongs to some An. Since Ay C Ae C ---, the non-zero 
proper values can be enumerated in a sequence by first writing down 
those in A;, those in Ag not in A, and so on, 


(2): Suppose ἃ, is any sequence of distinct proper values of 7’. 
Given any « > 0. By Theorem 3, |,| > ¢ for at most finitely many 
n, hence there is an index N such that |A,| < ε whenever n > N. 
Thus, A, — 0. 

(3): This is clear from the proof of (1); incidentally, {μι} = 
il. 1 

Henceforth, 1,u2,43,°** denotes the sequence of distinct non-zero 
proper values of Τ', enumerated as in Theorem 5. Then, P(ux) denotes 
the projection whose range is the proper subspace ϑυγίμμ) ; P(0) is the 
projection whose range is the null space of 7’. The next results will 
show that T = ΣῚῚ uxP(ux) in a suitable sense. 

Lemma. Jf 91;,9l2,0s,--- is a sequence of closed linear subspaces of 5 
such that I; L I, whenever 7 ¥ k, the following conditions are equiv- 
alent: 

(a) The Ny are a total family. 


(Ὁ) The smallest closed linear subspace of 3 which contains every Ny 
is 70 itself. 
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(ec) visi xe # ἐξ oe expressible in the form x = ΣῚ tk 
with x, ΕΞ Ny and bE | zx ||? < ο. 


Proof. 

Let ὃ = {x Ε K: x € Ny for some k}; as shown in Cor | 
Theorem ITI.6.3, 1 = $** is the smallest closed linear ΠΝ lie 
taining every ϑῖκ. Since m+ = st++ = s+ clearly 9% = 20 if and 
only if St = {6}. This proves the equivalence of (a) and (b). 

Let P;, be the projection with range 2,. 


(a) implies (c): Suppose first that 2, & 9%} and Σ Il ze ll? <0. | 
It is clear from the Lemma of Theorem 11.7.1 that one can form the 
vector ὦ = ba tz = lim, 2 ἄρ. Since Pjr,=6@ when kx 7; 


Ρ, (=: 3) = α) whenever n > 7, hence Pjz τε lim, P; (x 4} = 23. 


Now, given any xz € δῦ, define 2, = Px. For each m, Qn = Do, Px 
is a projection (see Example V1.7.4); since the x; are mutually orthog- 
onal, 37 I] ax ll? = || Oy ae ll? = |] Dy Pee |]? = |] Que |? < If x |, 
hence D>, || 2x ||? « οὐ, Form the vector y = D0) ax. For each 7, 
Piy = 2; = Px, Pi(y — 2) = 0, y — x € Mj"; since the 0, are total, 
y~-2=-),2=y= pay rp. Since x; = P; aa ns), the uniqueness 
of such a representation is clear. 


(c) implies (a): Suppose x .1 δῖ; for all j. By (ec), write x = p ting LE 
with 2, Ε M,. Since z € N;+, 6 = Px = 2; for all 2, thus x = 9. Ι 


Definition 1. 1 the conditions of the Lemma are fulfilled, one writes 
50 == Day Hs = Ni (Ὁ 91) O Ns Φ- --. 


More generally, let 9,,%2,%3,-+- be any sequence of closed linear 
subspaces, and 9 the smallest closed linear subspace containing every 
Ny. If x 1 Ny, for all k, then x 1 M; for, writing 8 = {x}, one has 
Nz C 8 for all k, hence X C $+. It follows that 9; is a total family 
of subspaces in the Hilbert space ot. If moreover Jt; | WW, whenever 


7 # k, one has % = 1 στρ in the sense of Definition 1. It is easy to 
see that a Ty, = Wy (Ὁ do Jy. 
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Theorem 6. If μη is the sequence of distinct non-zero proper values of T, 
then 


(1) 56 = συρ(θ) Ὁ Day Hr (ue) 
= Hr*(0) (Ὁ oe Tr*(ux*) ; 


(2) Myr(0) = Rr*(0) is the null space of T and T*, and 
TGC) = TH) = Ly ϑιτίμι) = Dy eee"); 
(3) | T— ΣΙ μεΡ(μρ || -- 0, 


Ι Τ᾿ — Σὰ me*P(ux) || > 0. 


Proof. 

(1) results from Theorem 1, the Lemma, and the fact that MWra(u*) 
= ϑιγίμ). 

(2): Let = et Sr (uz). It is clear from (1) that 9 = [3tr(0)]* = 
[atp*(0)]*; thus, 9 = T*(C) = T(a) by Theorem V.8.6. 


(3): Given z € 30, one has « = P(0)z + Σὺ, P(ux)x by the proof 
of the Lemma. Then, Τὰ = 6 + ἜΣ με (μμ)Σ = lim, [Σ᾿ μρω |x 
Since z, — 0, || T — 0; μεΡίμι) || > 0 by essentially the same ar- 


gument as in Theorem 2.1. The remaining assertion follows from 
|S* |= [8}. ὃἅὕ 

Corollary 1. If T is a normal CC-operator, there exists a sequence of 
finite-dimensional operators ΤΆ such that || T — Ty || — Ὁ. 

Corollary 2. Notation as in the theorem. Given any vector y © T(3), 
there exists an expansion y = >) yx, where the y, are mutually orthog- 
onal, j | Yk I? < ον, and Ty, = μεῖ!κ- 
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Exercises 


1. Let Tx; = »,xx, with notation as in Example VII.1.5. Then, T 
is CC if and only if μη — 0. 


2. An operator 7 is CC if and only if there exists a sequence of 
finite-dimensional operators 7’, such that || 7, — T || — 0. 


3. Let T and S be operators. 

*(i) It can be shown that if 7’ is normal, and ST = TS, then ST* = 
T*S. Granted this result; if S and T are normal, and ST = TS, then 
S + T and ST are normal. 

(ii) If T and S are hyponormal, and ST* = T*S, then T — S is 
hyponormal. 

(iii) If T is normal, S is hyponormal, and ST = ΤΡ, then T — S is 
hyponormal. 


4. If S and T are normal operators, ST = 7S, and T is CC, then 
S + T and ST are normal. 


5. Let S and T be normal CC-operators, and assume, for the sake of 
simplicity, that both S and T are injective. Then, S and T are unitar- 
ily equivalent if and only if: for each scalar μ, the multiplicity of μι for 
T equals the multiplicity of μ for 5. 


6. (i) If S and T are similar operators, and 9tg() has finite dimen- 
sion n, then 9tr(x) also has finite dimension n. 

(ii) Suppose S and 7 are normal CC-operators. Assume for the sake 
of simplicity that S and T are injective. If S and T are similar, there 
exists a unitary operator U such that U*SU = 7’; that is, S and T are 
unitarily equivalent. 

*(iil) One can prove (ii) without the hypotheses of complete con- 
tinuity and injectivity. 


7. The most general normal CC-operator in 920 can be obtained as 
follows. Let 3to,9t;,22,--- be a total sequence of mutually orthogonal 
closed linear subspaces, with St, finite-dimensional for k > 1; let 
H1,M2,*** bea sequence of scalars such that 4, — 0; let μὴ = 0. There 
is a — operator 7’ such that 71 = με when x € IY, (k = 
0,1,2,---). 
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8. Let T, be a sequence of normal CC-operators such that Τ Τὰ = 
TT; for all 7 and k. Then, there exists a vector z which is proper for ; 
every Τ᾽. Appendix 
*9. Theorem 3 holds for a CC linear mapping in a normed space. 
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the Exercises 


**10. The analog of Exercise 2 for Banach space is an open ques- 
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11. Does there exist a hyponormal CC-operator which is not 
normal? 
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CHAPTER | 

§1. 2. See Example 2. 

3. See Example 7. 
§5. 6. ef = cosh ¢ + sinh ¢. 

8. Show that each vector in 3% + 9 is in [8], and vice versa. 
§6. 4. Factor theorem. 

5. (ii) See Hxercise 4.4. 
δ 7. 6. See Section 1.72 of [10] for the existence of a basis, also 


called a Hamel basis. 
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CHAPTER Il 


. See Theorem 388 of [5]. 
.c=(x—yt+y. | 
. Suppose the relation holds. Then, in the proof of Theorem 4, 


all < signs can be replaced by =; cite Exercise 2. 


. See Hzercise 4. 
. Exercise 4, and the relation x = (@ — y) + ν. 
. Assuming || x |] < 1 and || y || < 1, it suffices to show that 


\(xly)| <1. If (ely) is real, (ely) = F{l|e+y |? — 
|| « — y ||?}. In general, one can write |(x|y)| = λίαν) = 
(Ax|y) for suitable |A| = 1. 

Trye= α 


. For each t, [σι (ἢ — tn(t)| < d(tm,2n). 


See Theorem 4.1 or Exercise 3.3. 


. If |A| < 1, then {A|? < JAI. 
. || Dz ll < >, || ve ll. 
Bessel. 


. Assuming to the contrary, produce an infinite orthonormal 


sequence x, such that [21,22,03,-++] = [y1,Y2,¥a,°**] = σύ, 
and look at the vector = δὴν (1/k)zx. 

$ contains the vectors z, = (1,0,---,0,—7,0,0,---), where 
—n occurs as the n’th coordinate. 


. Gram-Schmidt. 


4, 5. See Theorems 1.4 and 1.14 of [9]. 


. See Theorems 155 and 479 of [5]. See also Section 3.22 of 


[10]. Another approach is via Theorem 4E of [6]. 


. See 814 of [3]. 


CHAPTER Ill 


(i) See Theorem 11.8.1. 

(ii) Look at = D0y (1/k)ate. 

See Fzercise 11.0.1. 

See the proof of part (3) of Theorem 4.3. 
See the proof of the Lemma to Theorem 2. 
See Theorems 2.1 and 2.2. 

See Exercise 2.3. 

Calculate 91. 


§ 5. 


ὃ 6. 


§ 8. 


δ 1. 


§ 2. 


§ 3. 


§ 4. 
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(iii) If y, is a Cauchy sequence of vectors, and ||, || --Κ 
a > 0, consider the sequence z, = || yn || yn; see Exercise 
8.1. 


. If y = Ox) CS, then) =| Doma] < ΣΟ Ιλ} = DTI: |e 


Μὴ : 
«ν (ΣΕ) =Vn iy ἢ, thus ἢ ν ἢ Σ 1/Vn. 


Clearly yo ΕΞ 8 and || yo || = 17 Vn. Quote uniqueness. 


. Look at the vectors "= (1,0,0,---), ae (3,2,0,°++), 


¥3 = (3,3) 350)° 55}, 5 5. 


. See Exercise 11.8.1. 
. Assuming to the contrary that such a vector yo € 8 exists, 


one has z — yo € 8 by the argument used in the proof 
of Theorem 1. But, x and yo belong to $+, hence x — yo 
€ δ᾽“; in particular,  — yo 1 x — yo, t = Yo € ὃ, 8 
contradiction. 


. Write t, = Yn + Zn according to Theorem 1. 
. (i) See Theorem 1.18 of [9]. 


(ii) See also Exercise 4.1. 


. Use condition (1’) to show that T(u + v) = Tu + Tv forall 


uy € HK, 
CHAPTER IV 


. Let ¢,+++,€, be the canonical basis for Θ᾽ (see Example 


1.7.1), f1,*+*+,Jm the canonical basis for @”, and express ΤῈ 
as a linear combination of the f;. 


. See Section 1.72 of [10] for the existence of a basis (also 


called Hamel basis). 


. See Lrercise 1.4. 

. See Exercise 1.8... 

. See Hzample 1.7.8. 

. Let 2,---,2, be a basis for VU, and y;,---,¥m a basis for ‘W. 


For each pair of indices j and k (7 = 1,--+-,m;k = 1,-++,n), 
there is a unique linear mapping ἔπ: 0 — W such that 
Ex, = yj and Ej,.2; = @ when i + k. The Τὴ κ are a basis 
for £(0,W). 


. There is no essential difference between this and the situa- 


tion in Hzercise 3. See also Theorem 1.3 and Exercise 1.7. 


. (ii) Suppose T is surjective. For each y € Y, let F(y) denote 


the non-empty set {x Ε 80: Τὰ = y}. The problem is to 
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21. 


define a mapping R: Y — 8. such that T(Ry) = y for all 
y Ε Y, that is, Ry € F(y) for all y € ψ. This is possible 
via the axiom of choice; see Theorem 1D of [6]. 

(iii) See Exercises 2.9 and 2.10. 


. Show there exists a non-zero vector x, in the range of Fj, 


and define a, = 4,2). 


᾿ 3 ([a,b] + [b,a]) = 6, {a,b} Ae. {b,a} = θ. 
. For example, see Exercise 4.5 and Theorem 1. 
. See the proof of Theorem 11.8.3. 
See Ezercise 2. 
. See Hxample 11.4.8, 
The uniform limit of a sequence of continuous functions is 


continuous. See Theorem 229 of [5]. 


. T is continuous by Hzample 6. Consider the real-valued 


mapping x — || 7'x ||, where || z || = 1. This is a continuous 
mapping defined on a closed and bounded subset of @”. By 
the Weierstrass-Bolzano theorem, suitably generalized to 
e", there is a vector 7, € @", || 2, || = 1, such that || Tx || > 
|| Zz, || > O whenever || x || = 1. This implies that 7? is 
continuous. See also Section 3.12 of [10]. 


. See Lxercises 12 and 3. 
. See Exercise 13 and Remark 3 following Definition IIT.4.1. 
. For the proof of completeness, see the argument used for /? 


in Example 11.6.1. 


. See the discussion of 15 in Example 11.8.1. 


19. The inequality which exhibits subadditivity is known as 
Minkowski’s inequality. See Section 21 of [8], or Section 14 
of [6]. 


. Without loss of generality, one can suppose |A,;| = -:- = 


Ιλ. = 1 and |Ay| «1 for m<k <n. Then (|| z||,)? = 
m+ Σ 41|Ae/?, thus || 2 ||) = (m+ ap)”, where ay > 0 
as Ὁ — οὐ, It follows that || x ||, — 1 = || z||,.. See also 
Theorem 24A of [6]. 

See Section 10A of [6]; the relevant paper of J. von Neumann 
and P. Jordan is cited in the bibliography of [3]. 


. (ii) Suppose yn € F, || Ym — Yn || — 0. Choose any z € Φ 


with || z || = 1. Define T,,: Φ — 5 by the formula ΤΣ = 
(x|z)yn. Clearly || Tim — Tn || = || Ym — Yn Il. 
(iii) See Exercise 10.12. 


μι δὴ oP 
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.@) If χε αὶ θὲ and x» — 2, look at || Τρ -- Τα ἢ < 


| Tt — Tre || + || Tate — Trae || + || Tate — Taz || < 
K || -- a || + || Tmte — Tare || + K || ae — z ||. See 
Theorem 2.11.4 of [4]. 


. See Theorem 2.5.5 of [4]. 

. See Mzercise 4. 

. See Hzercises 3 and 6. 

. If ὃ has dimension n, let S: Θ᾽ — & be any vector space 


isomorphism. By Fzercise 7.12, S and S~ are continuous. 
Since 7S: Θ᾽ — δ᾽ is continuous by Example 7.6, T = 
(T'S)S is continuous by Theorem 1. 


. See Hxamople 5.2. 


See Hxercise 10.12. 

See Theorem 7.6. 

See Hzercises 9.1 and 8.8. 
See Hxercise 2. 


. This is immediate from Example 2; see Theorem 2.7.4 of [4]. 
. This follows from HLxample 2; see also Theorem 8D of [6]. 


(i) See Hxercise 2 and Theorem 3. 
(ii) Part (i), and Exercise 7.12. 
(iii) See Hxereise 5. 


. (v) || 7’ || > || 7 || results from Hzample 2; see Section 4.5 


of [10]. 

Let to Ε &, || zo || = 1. By Theorem 2.7.4 of [4], there exists 
an f € &’ such that || f || = 1 and f(z) = 1. Define 
Tn: ὃ — δ᾽ by the formula T,2 = f(x)yn. Clearly 
| Tm — Tr || = l| Yn — Yn Il. 


CHAPTER V 


. Choose y € Ὁ with f(y) = 1, and look at x = f(x)y + 


[x — f(x)y). 


. In view of Theorem IV.10.2, it suffices to show that given 


any f € Φ', there is a vector y ΕΞ Φ such that f = y’. See 
the proof of Theorem 1. 


. Assume to the contrary that 9t = ὅ0, and choose a vector 


z € (5t)+ such that f(z) = 1. See the proof of Theorem 1. 
Incidentally, two linear forms having the same null space 
are necessarily scalar multiples of each other. 
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See Exercise 4. This result can be generalized to normed 
spaces; see Theorem 3.5-E of [10], or the Corollary to 
Theorem 2.6.2 of [4]. 


. Given x € @, the mapping y — (μ 1) is a continuous linear 


form on the Hilbert space 2. 


. (iii) See the Corollary of Theorem 11.0.8, 
. Given F ΕΞ 70“ ,ἡ — [F(z’)]* is a continuous linear form on 


5c. By Theorem 1, there is a vector y ΕΞ & such that 
[F(x’)]* = (z|y) for all © 3. Thus, F(z’) = (y|z) = 
μ" (4). Then, F = y"” by Theorem 1. 

See Remark 2 in the proof of Theorem 1, and the Corollary 
of Theorem I1.6.3. 


. Take Φ = & in Theorem 1; since U is isometric, and X is 


complete, U(K) is complete. 


. (i) See Exercise 1V.7.21. 


(ii) See the proof of Theorem 1; alternate proof via part (i). 


. @ If χ΄,ν' Ε 91, define (2’|y’) = (y|2). 


(ii) See Exercise 8. 


. (i) See Exercise IV.10.5. 
. (ill) See Lxercise 3. 
. See Subsection 13 of §2 in Chapter I of [7], or Theorem 


2.41-A of [10]. 
In view of Example I.1.2, it suffices to show that » + Ψ and 
Ae are bilinear. 


. (ii) See Theorem 4.2-H of [10], or the Corollary to Theorem 


2.12.1 of [4]. 

(iii) This result is known as the closed graph theorem. It 
can be deduced from (ii) by considering the mapping 
(x,Tzx) — x, which is a continuous vector space isomorphism 
of the Banach space Gr onto the Banach space &. See 
Theorem 4.2-I of [10], or Theorem 2.12.3 of [4]. 

o(Xn,Yn) τι; g(x,y) cr φίτῃ — 2Yn — y) Ἔ φία μη ΒΕ y) τ 
φία, — x,y). For the proof that (6) implies (a), see the proof 
of Theorem 17.7.8, 


ΚΗ ἃ, € M and y, € MN are Cauchy sequences, then 


¢(2n,Yn) ΕΞ ὦ isa Cauchy sequence, by an obvious modifica- 
tion of the identity given in the hint to Exercise 1. See also 
Theorem 17.7.60. 


. Say & is a Banach space. Consider the family of mappings 


8 Ξ (φῦ: ν Ε σ, || y || <1}. For each 7 € 8, || φ᾽ (4) || = 
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ΟΠΠ e@y) Il = Il ee) |] < Il ee Il ly |] < |] φ- || whenever 

.|ly || < 1. Thus, 8 is a “pointwise bounded” family of con- 
tinuous linear mappings of the Banach space ὃ into the 
normed space 9. By the principle of uniform boundedness 
(see Exercise IV.8.4), there is a constant M such that 
|| φῦ || < M whenever |! y || < 1. It follows that || ¢(x,y) || < 
M whenever || x || < 1 and || y || < 1; thus, ¢ is bounded, 
and || ¢ || < Μ. 

9. (i) See Example IV.10.2, and Section 8C of [6]. 

11. See Lzercise 10. 

12, See the proof of Theorem IV.8.2; alternate proof via Ezer- 
cise 6. 

2. (i) See Example 4 and Theorem 1.2. 

(ii) See Exercise 2.4. 
(iii) See Example 8. 

9. (i) Show that J(a + y) — (Jz + Jy) and (λυ) — τα) 
are orthogonal to J(@); by assumption, they belong to J(@). 
(iii) Show first that J is conjugate-linear and isometric. 

3. φία + yz + y) — φία — νιν — y) = 4(z,y). 

3. See Theorem 3.5 of [9]. 

6. See Exercise 4.8. 

9, See the discussion of 15 in Example 11.8.1. 

1. (i) See Hxercise 10. 

(v) The surjectivity of the correspondence is discussed in 
Exercise VITI.2.1. 

(vi) Apply the polarization identity to the sesquilinear map- 
ping (8,17) — T*S. 

13. (iii) See Theorem 111.4.1. 

14. (ii) See Hxercise 13. 

15. (i) See Hzercise 14. 

(ii) See Theorem IV .9.3. 
16. See Theorem 5 in § 24 of [7]. 


CHAPTER VI 


4, (ii) See Exercise V.8.3, 

6. (i) One can show that if 91 is a closed linear subspace of a 
normed space ὃ, y € δ, and y is not in 9, then there exists 
a continuous linear form f on & such that f(x) = 0 for all 
x € KR but f(y) + 0. See Theorem 2.7.5 of [4]. 
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(ii) See Theorem V.1.1. 


. (i) See Hxample 177.10.1. 


(ii) See Exercise II.8.1. 


. Gi) Tx = Ty if and only if Sr = Sy. 


(ii) See Theorem 2. 


. See Hxercise V5.9. 


see Nzercise V.5.9. 


. See Theorem V.8.1. 
. See Ezercise 4.9. 
. (Ὁ See Theorem 4.2. 


(ii) See Exercise 4.11. 


. See Lxercise 7.12. 
. (i) See Theorem V.7.2 and Theorem 8. 


(ii) See the proof of Theorem 3. See also Exercises V.5.8 and 
V.8.18, 


. (vii) See Theorem V.5.5. 


(ix), (x), (xi) See Section 104 of [8]. 
(xii) See Section 108 of [8]. 


. See Section 104 of [8]. 
. See Lxample 4.1. | 
. (iv) By (iii), it suffices to show that [R(3¢)]* = {6}. Indeed, 


since (Tz|x) is real, (Ra|z) = 0 implies x = @. 


. The details are similar to those in Exercise 12. See also 


Exercise 11. 


. See Theorem 11].8.1. 

. See Theorem 2 and Example 4. 

. (i) Caleulate (ΤΡ — T)*(TP — T). 
. (i) Dimension. 

. (Ὁ Look at (7*T)*(T*T). 


(ii) See Exercise 7.12. 

Τ — pl is normal. 

See Theorem V.8.6. 

See Exercises 5 and 4.8. 


. See Exercise 4.8. 
. (i) See Exercise 6. 
. T and T* are both bijective. Let R = Τοῦ, S = (T*)—. For 


all x and y, (Rx|y) = (Rx|T*Sy) = (TRx|Sy) = (x|Sy). 


. (iv) Part (iii) and Theorem V.8.6. 


(v) Apply (iv) to ΤῈ, and quote Exercise 9 part (ii). 


. ΤῊ is injective by Theorem V.8.6. Let 8 = {x € 80: ΤῊΣ || 


§ 9. 


14. 
15. 


10. 
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< 1}. Given any y € &, the numbers (x| y) are bounded as 
x varies over 8; for, supposing y = Tz, for all x Ε 8 one has 
[@ly)| = [|172}} = |(T*2|2)| < || T*x || | 2}} < || 2 I. 
It follows from the principle of uniform boundedness (see 
Exercise IV.8.4) that there is a constant K > 0 such that 
| z || < K forall z € 8. It follows that || 7'*y || > K7" || y || 
for all y € 3; this is clear if y = 6, and if y ¥ 6, then 
T*y ¥ 0, and the vector x = [ΤῊ ||—1y belongs to 8. 


. T'* is bounded below by Ezercise 12. See part (v) of Exercise 


ii; 

(ii) See part (vi) of Exercise 11. 

(vi) This is a theorem of C. R. Putnam; see the American 
Journal of Mathematics, Vol. 73 (1951), pp. 357-62. 


. (i) See Exercise 4.7. 


(ii) See the calculation in Example 9. 


. (ii) See Exercise 6.9. 


(iii) See the calculation in Example 9. | 
(i) Ife Ε 91, || R*x || = LUB {| (R*2|y)|:y C9, || y || <1}. 
Alternate proof via Exercise 4. 


11. See Exercises 4.8 and 6.9. 


12. 
14. 


See Lxercise 11. 

(i) See Exercise VII.2.6. There is also a simple argument 
using “trace,” 

(ii) 7'/S is hyponormal by Exercise 10, hence Τ' [91 is normal 
by part (i). Quote Ezercise 9. 


. Calculate T*T — TT*. 
. See Lvercise 8 and Example 8; see also Exercise 14. 
. See Example 8. 


CHAPTER Vil 


. See Example IV.7.6. 
. For example, let 7' = U + U*, where U is any operator 


such that U* is isometric and U has no proper values; for 
instance, U can be the two-sided shift operator (see Hxample 
8). Suppose Tz = ax. Then, Ux -Ε U*x = ax; applying U, 
one has UUz + z = az, (UU — aU + 1)z = 6. Suppose 
? —at+1=(¢—A)(t—u). Then (U—AI)\(U — μὴς 
= θ. It follows that ὦ = θ. 


. It is easy to see that 7’ — μ is hyponormal, for every scalar 
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p. Or, apply part (iii) of Exercise VI.9.9 to the linear sub- 
space generated by z. 


. (i) See Frercise 11.8.9. 


(iii) Let Tx, = 24, be the one-sided shift operator, and set 
z= ye a,2,. By Example 6, T has no proper values, hence 
|(Tx|zx)| < || T || by part (i). 


. See the Corollary of Theorem VJ.9.2. 
. See Theorems 2 and 3. It can be shown that if Τ' is any 


normal operator, then S7' = TS implies ST* = T*S; see 
Theorem 41.2 of [3]. 


. Clearly 9ip(u) reduces 7; let S = T* in Theorem 8. 
. (i) See Exercise 1.6. 


(ii) 7/9 and ΤῊ "δὲ are scalar operators; see also part (iii) 
of Exercise VI.9.9. 

(iii) See part (i) and the proof of Theorem 2. 

(iv) See Exercise 4. 


. There is a simple argument using “trace.’’ Another proof, 


by induction on the dimension n of 3¢, runs as follows. For 
n = 1, every operator is normal. Assume the theorem holds 
for dimensions <n. It can be shown that every linear map- 
ping in a finite-dimensional space has at least one proper 
value (see ὃ 55 of [2]). Let μ᾿ be a proper value for 7’, and let 
M = Rp(u). By Exercise 5, 9 reduces 7’, and 7'/9 is normal. 
But, στ Ὁ also reduces Τὶ and Τ 91 Ὁ is hyponormal by Ezer- 
cise VI.9.10, hence T/9* is normal by the inductive 
assumption. See Hxercise VI.9.18. 


. Induction on ἢ. | 
. | |] Ten |] — | wen |] | Φ |] Ten — μαι |I- 
. (i) T — pl is hyponormal. 


Gi) (u — »)(@n| Yn) = (Cul — T)an| yn) + @n| (T* — v*D) yn). 


. Let S = ΤῊ, where Τχκ = 2,4, is the one-sided shift oper- 


ator, and consider μ = 0. 


. See the hint for Pxercise VIII.3.1. 
. If || en || = Land || Tz, — urn || — 0, define S,2 = (x|2n) Xn, 


as in Example VI.7.2. Conversely, if the operators S,, satisfy 
condition (Ὁ), choose vectors y, with || y, || = 1 and 
| δεν || = ἢ and set tn = || Sn¥n { 'SnYn- 

Suppose || 2, || = land || T'z, — uz» || — 0. By Section 8C 
of [6], there exist continuous linear forms f, such that 
fn || = Land f,(za) = 1. Set Sx = fn(x) an. 


81. 


12. (i) See Hzercise 8. 


13. 


o ὧτ bo 


(ii) See Hzercise VI.8.9. 

(vi) TO -- κ ἢ = pl — TT, 

(vii) See Hxercise VI.6.12. 

(viii) Exercise VI.6.18. 

(ix) See Subsection 3 in § 9 of [7], or Theorem 4.7.4 of [4]. 
(x) Suppose μ is not an AP-value. By Hzercise 8, T — ul is 
bounded below, hence T — ul is invertible by part (iv) of 
Exercise VI.8.14. 

(xi) If μ € s(T), » is an AP-value by part (x), hence [μ| = 
1 by Exercise 3. 

(xii) Exercise 1.4 and parts (ii), (ix). 

Let σὰ be an orthonormal basis for 5C, and choose a sequence 
μι: A such that every » € A is the limit of a subse- 
quence of μι (see Exercises [1.8.8 and IT1.6.6). Define Tx, = 
HnXy. Every μ € A is an AP-value for T by the argument 
in Example 2. If ἃ is not in A, there is an e > 0 such that 
lun —A| > ε for all n. Clearly (7 — AJ)*(T — AD) > 2, 
hence ἃ is not an AP-value for 7’, by Exercise 8. Indeed, 
Τ' — ΧΙ is invertible by part (iv) of Exercise VI.8.14, hence 
Ais not in s(7’). 


CHAPTER VIII 


. T is an operator by Example IV.7.6. 
. See Theorem 4 and Example 8. 
. ( If || z, || = 1 and 7*7z, is convergent, then Tx, is 


Cauchy by the calculation 


Ι Tem — tn) ||? = (T*T (tm —2n)| tm — Zn) 
J 2 || T*T2y — T*T xp |\. 


See Subsection 10 of § 5 of [7]. 
(ii) (7'*)*7T* = TT* is CC by Theorem 4. 


. (i) See Example 3. 


(ii) See Exercise VI.8.12, Exercise 6, and part (i). 


. (ii) Let 91 be the n-dimensional linear subspace generated by 


the y;, and define a linear mapping Τ': σ᾽ — 20 such that 
Tyr = z (see Theorem IV.1.8). T is continuous. Define 
T = 0 on 9", and extend 7' to 50 via the relation 20 = 
9 (Ὁ N~. Then, T is a finite-dimensional operator. 
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9. 


10. 


11, 


12. 


18. 


14. 


15. 


16. 


(i) See Theorem IV .9.8. 

(ii) See Fxercise V.8.13. 

(ii) See Example 3 and Exercise 2. 

(iv) See Exercise V.8.15. 

(ii) It suffices to consider an operator with one-dimensional 
range, say Τὰ = (x|x,)z; one can assume || σι || = 1. Let δῖ 
be the null space of 7’; 91 is a separable Hilbert space (see 
Exercise II T.6.5), hence has an orthonormal basis 15,15," “". 
Then, 21,%2,%3,°:* is an orthonormal basis for 3, and 
Lin |! Τὰς [12 = || Tx: |I?. 

Let Rk € 5, R # 0. If zis a vector such that Rz ~ 6, and A 
is the operator Ax = (z|z)z, clearly J = RA is an operator 
with one-dimensional range, and Τ € g. One has 71 = 
(x|z)y, where y = Rz. Suppose S is any other operator with 
one-dimensional range; by Evercise VI.7.14, Sx = (a! 2,)y; 
for suitable vectors ¥;,2,. By Exercise 8, there exist operators 
B and C such that Bz = z, and Cy = y,. Then S = CTB*, 
hence S € g. Thus, 9 contains every operator with one- 
dimensional range. See the proof of Theorem 8. 

If || Tz, || >n, Tz, cannot have a convergent subse- 
quence. 

(i) See Theorems 1 and 2. 

(ii) See Theorem 4. 

(iii) See the proof of Theorem ὅ. 

(v) See Hxercise IV .7.12. 

(vi) If & is infinite-dimensional, one can construct an infinite 
sequence x, such that || 2, || = 1 and || 2 — 2, || > 4 
whenever m τέ ἢ; see item V in Subsection 1 of §4 
of [7]. 

If & has dimension n, let S: Θ᾽ - & be any vector space 
isomorphism ; 8 and S~' are continuous by Fzercise IV.7.12, 
and 77'S: ©" — § is continuous by Hxample IV.7.6. If 
tm ΕΞ &and |! x, || < 1,S7'z, has a convergent subsequence 
(by the complete continuity of J: eC? — @”), hence so does 


(T'S) (S~*x,). 
(i) If || «|| = 1, zn Ε MN, and z, — z, then || z, || > 1; set 
Ya = I en "Be: 


(ii) Ifz, ΕΞ ὃ and | Zn || < 1, choose yn € 91 with lyn || = 
|| &n || and || yn — an || < 1/n. 
see Theorem 2.13.5 of [4]. 


" 

.] 
ἡ | 
¢ » 


§2. 1. 


2. 
§3. 1. 


Appendix 20] 
Let Τί, be the unique operator such that T,2, = >." 


+ ae | Niky 
for all k. Then, 7’, is finite-dimensional and |! T,,, — T, —>0. 
Observe that Σ᾽... |ajx|? < ©, and quote Exercise 1. 

(i) See Theorem 8. 

(ii) By Theorem 26A of [6], there exists a scalar » in 
the spectrum of 7 such that |u| = || T ||. By part (x) of 
oe VII.3.12, w is an AP-value. Quote Theorem 

.«Ὁ.1. 


4. Let u be a scalar in the spectrum of 7' with |x| as large as 


“I o> Oo 


possible (see part (ix) of Ezercise VII.3.12). Let \» be any 
sequence of scalars such that |A,| > [μ| andA, — μ. De- 
fining k, = T — d,J and R = T — yl, one has || R, — R || 
— 0; the &, are invertible, and F isn’t. We assert that μι is 
an AP-value for T. Assume to the contrary; by Exercise 
VII.3.10 there exists an e > 0 such that || RS || > ε when- 
ever || S || = 1. Then, || RS || > « || S || for every operator S. 
Fix any index m such that || R,, — R || < ¢«/2. For every 
operator S, εἰ 5 || < || RS|| = || (R — R,)S + RS || < 
| (ἢ — Ry )S || + || RS || < || R — Rm |} || S|] + 1 RnS || < 
¢/2 || S || + || R,S ||. In particular, for S = R,,—!, one has 
€ || Rm? || S €/2 |] Ba || + 1, hence ¢/2 < || Rpm! {τὰ 
Then || μα — BR || < ἐῶ < || BR, ||. It follows that 
fe is invertible (see the proof of Theorem 4.3.2 of [4]), a 
oe See also the Corollary of Theorem 4.11.1 
of [4]. 


. See § 55 of [2]. 
. See Theorem 5.5-C of [10]. 
. For any operator T, lim || 7” ||!/" = LUB { |u|: € s(T)}, 


where s(7') is the spectrum of 7’ (see Theorem 4.7.3 of [4]), 
hence the equivalence of (Ὁ) and (c) is clear. 

(b) implies (a) by part (i) of Exercise VII.3.12, and Exer- 
cise 4. 

(a) implies (b) is implicit in the hint given for Exercise 4. 
It should be mentioned that the usual definition of “gen- 
eralized nilpotent” is condition (c). If Τ' is normal, it can be 
_ that lim || 7” ||'/" = || T || (see Theorem 4.12.1 of 


. Since S has no proper values, by Theorem 2 it cannot have a 


non-zero AP-value. Incidentally, || Sz, || = 1/k — 0 shows 
that 0 is an AP-value. 
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§4. 2. See the Corollary of Theorem 1.5 for the “if”? part. Con- 


versely if 7 is CC, then ΤῊ is CC by Exercise 1.6; consider 
the Cartesian form 7 = A + 7B. 


. (i) See the hint to Exercise VII.2.3. 


(ii) Show that (7 — S)(T — S)* < (Ὁ —S)*(T — 5). 
(iii) Parts (i) and (ii). 


. See Theorem 2. 
. (iii) The reference for this is given in the hint to Lxercise 


VI.8.16. 


. This is clear if 7. = 0 for all k. Assume otherwise, say 7, γέ 


0. Let u be a non-zero proper value for 7',, and consider Xt = 
σύν, (μ). N is invariant under each Ty. Since moreover 
T,.*T, = T,T,*, 9% reduces each Τὶ. . 

Among the finite-dimensional linear subspaces, different 
from {6}, which reduce every Τ᾽, let 9 be one of smallest 
possible dimension. Let Ry = 7;,/2. It suffices to show that 
every R; is a scalar operator. Assume to the contrary, say, 
that R; is not scalar. Let ἃ be a proper value of ὦ}. By 
assumption, 9p;(A) γέ ὅπ. But, the earlier argument shows 
that 91p;(A) reduces each Rx, hence each Τὰ (because already 
91% reduces each Τ᾽); this contradicts the minimality of the 
dimension of SM. 


9. See Theorem 5.5-G of [10]. 
10. See the Remark preceding Theorem 2.13.4 of [4]. 
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best approximation, 46 
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bounded below, 155 
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bounded bilinear mapping, 120, 122 

bounded function, 99 

bounded linear mapping, 95 
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bounded sesquilinear mapping, 128 
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Cc", 26 

canonical basis, 21 
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Cartesian form of an operator, 148 
Cartesian product, 116 
Cauchy-Schwarz inequality, 30, 126 
Cauchy sequence, 34, 39, 93 
Cayley transform, 150 
CC-operator, 172 

center, 38 

classical Hilbert space, 56 

closed ball, 38, 40, 98 

closed graph theorem, 194 


closed linear subspace, 60, 62, 64, 66, 


71, 94, 140 
closed subset, 59, 63, 66, 91 
closure, 62, 63, 64, 71 
coefficient, 11 
commutative algebra, 89 
commutative law, 3 
complete linear subspace, 66 


complete metric space, 34, 65, 66, 


97 
complete subset, 65, 66 
completely continuous operator, 172 
completion, 112, 115 
complex conjugate, 25 


complex-conjugate of a vector space, 


124 
complex vector space, 3, 24 
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component, 4 

composite of mappings, 86, 88, 103 

conjugate, 25 

conjugate-homogeneous, 81 

conjugate-linear, 81 

conjugation, 127, 128, 146 

continuous linear form, 105, 109, 110 

continuous linear mapping, 94, 96, 98, 
99, 120, 139 

continuous mapping, 91, 94 

contraction, 150 

convergent sequence, 34, 39, 91, 93 

convex set, 67 

coordinates, 116 

correspondence, one-to-one, 74 


diagonal procedure, 174 

difference of vectors, 7 

dimension, 23, 178 

direct image, 80 

discrete metric space, 33, 38, 39 

distance, 31, 33, 93 

distributive laws, 4, 89 

dual space of a normed space, 105, 
109 


element, 57 

empty set, 58 

equal mappings, 73, 117, 140 
equal sets, 58 


final set, 73 

finite-dimensional operator, 173, 174, 
175, 176, 183 

finite-dimensional vector space, 23, 
48, 51, 96, 99, 161, 175, 176, 179, 
181 

finite multiplicity, 178 

finitely generated vector space, 21 

finitely non-zero sequences, 5, 26, 35, 
69, 111 

Fourier coefficients, 47 

function, 73 


Gaussian rationals, 24 

generalized nilpotent operator, 181, 
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generated linear subspace, 15, 16 

generators, system of, 15 

Gram-Schmidt procedure, 47, 72, 
178 

graph, 117, 119, 120, 194 


Hahn-Banach theorem, 105 

Hermitian operator, 147 

Hermitian sesquilinear form, 126, 
129 

Hilbert-Schmidt class, operator of, 
136, 177 

Hilbert space, 40, 56 

homogeneous mapping, 77 

hyponormal operator, 161, 162, 166, 
168, 170, 175, 187, 188 


ideal, 175, 176 

identity mapping, 78, 87 

identity operator, 139, 178, 175, 
176 

inclusion, 57, 58 

infinite-dimensional vector space, 23 

infinite multiplicity, 178 

infinite sums, 49, 50, 52, 135, 136, 
143, 184, 185 

initial set, 73 

injective mapping, 73, 80 

invariant subspace, 156, 162, 166, 
182 

inverse image, 80, 84, 91 

inverse mapping, 83, 84, 87, 88, 98, 
99, 120, 155, 156, 161 

invertible operator, 155, 156, 161, 
162, 170, 171, 175 

involution, 137, 138 

isometric linear mapping, 97, 144 

isometric mapping, 97 

isometric operator, 140, 148, 146, 153, 
160, 161, 162 


isomorphic Hilbert spaces, 55, 56, 146 
isomorphie vector spaces, 82, 83, 88 
isomorphism, 55, 82, 88 


13, 40 

left-inverse, 88 

left-invertible, 88 

limit, 34, 39 

linear combination, 11 

linear dependence, 15, 16 

linear form, 104, 109, 110 

linear independence, 18, 19, 44, 47, 
168 

linear mapping, 77 

linear subspace, 13 


p-space, 166 

mapping, 73 

matrix, 82, 134, 177 

member, 57 

metric space, 33, 93 

metrically equivalent operators, 144, 
154, 155, 175 

minimizing vector, 68, 69, 72 

Minkowski’s inequality, 192 

multiplicity, 178, 187 


n-fold transitive, 175 

n-ples, 4, 26 

non-empty set, 58 

norm, 28, 93, 95, 121, 128 

normal operator, 154, 181 

normed algebra, 103 

normed *-algebra, 137 

normed space, 93 

null mapping, 77 

null space of a linear mapping, 80, 94, 
140 


one-sided shift operator, 144, 159, 
162, 165, 170, 171, 178, 198 

one-to-one correspondence, 74, 83 

one-to-one mapping, 73, 80 

onto mapping, 73, 88 

open ball, 38, 40, 93 


Index 205 


operator, 139 

ordered pair, 116 

orthogonal complement, 71 
orthogonal! linear subspaces, 61, 66 
orthogonal projection, 74 
orthogonal set, 43 

orthogonal subsets, 61 

orthogonal vectors, 43 
orthonormal basis, 52, 54 
orthonormal set, 44 


parallelogram law, 29, 100, 114, 127 
partially isometric operator, 153 
point, 33 
polarization identity, 29, 125 
positive operator, 149 
positive sesquilinear form, 126 
pre-Hilbert space, 25 

of continuous functions, 26 

of finitely non-zero sequences, 26 
projection, 74, 75, 151 

operator, 140, 151 
proper subspace, 166 
proper value, 163 
proper vector, 163 
Pythagorean relation, 44, 48, 50, 

62 


radius, 38, 40, 93 

range, 73, 80 

reducing subspace, 158 

reflexive space, 107, 111, 114, 123 
restriction, 158 

Riesz-F rechet theorem, 109 
right-inverse, 88 

right-invertible, 88 


scalar, 3, 24 
mapping, 78 
multiple, 4 
multiple of a linear mapping, 85 
operator, 139 
product, 25 
self-adjoint operator, 147, 170, 179 


milinear mapping, 81 
separable Hilbert space, 53, 54, 55 
separable metric space, 54 
sesquilinear form, 123 
sesquilinear mapping, 123 
set, 57 
similar operators, 156, 187 
skew-adjoint operator, 148 
spectral theorem, 181 
spectrum of an operator, 171 
sphere, 40 
square root of a positive operator, 

149 
Steinitz’s exchange theorem, 21 
strong convergence, 102, 108 
subalgebra, 137 
*-subalgebra, 137, 175 
subset, 57 
sum of linear mappings, 85 
sum of linear subspaces, 14 
sum of vectors, 3, 31 
superset, 58 
surjective mapping, 73, 175 
system of generators, 15 


total family of subspaces, 167, 181 


total sequence, 51 

total set, 51 

transformation, 73 

triangle inequality, 30, 33 

two-sided shift operator, 146, 151, 
160, 161, 165, 197 


uniform boundedness principle, 102, 
195, 197 

uniform convergence, 38, 102 

unitarily equivalent operators, 147, 
149, 156, 187 

unitary operator, 145 

unitary space, 26 


value, 73 

vector, 3 
space, 3 
sum, 3, 31 


weak convergence, 108, 111, 114 
Weierstrass-Bolzano theorem, 192 


zero mapping, 77 
zero operator, 139 
zero vector, 3 
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